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Preface 


A few years ago, our department started a new undergraduate degree program in Biomedical Physics. The 
prerequisites to enter this program included two semesters of physics and two semesters of calculus, both 
differential and integral. A group of participating educators (including some members from the departments of 
physics, mathematics, and biology as well as from the School of Medicine) developed the courses and their 
contents for this new program. For one of these courses, Mathematics for Biomedical Physics, all the required 
topics were not covered in any single textbook available. As the instructor of this course, | had to prepare my 
own personal notes for some of the topics that | shared with students. | felt that it would be great if a single 
textbook can be produced which covered all the mathematical topics that were needed for the biomedical 
physics program. | wanted to make this textbook freely available to all students everywhere using internet. 
Access to education is a basic human right which should not be hindered by any lack of money or dearth of 
educational resources. With this thought in mind, | started turning the course materials and lectures that | had 
been using in my course into an open educational resource (OER). After transcribing all the notes in Word, | gave 
the notes away to students in my next class, seeking their feedback and comments. | have carefully included 


responses to these comments in the book. 


Even though the prerequisites for this textbook are two semester-long courses in calculus, which cover functions 
of single variables, the first two chapters of this textbook are devoted to differential calculus and integral 
calculus. These chapters lead the reader from calculus of functions of a single variable to calculus of multivariable 
functions. In this scheme, the ideas related to partial derivatives as well as multiple integrals are revealed quite 
naturally. Throughout the textbook | have attempted to start with something that a typical student may be 
familiar with and end up with something that will be entirely new for the reader. | would appreciate receiving (at 
wadehra@wayne.edu) constructive feedback, from students and faculty as well as other readers, who are using 


this book as a whole or in parts. 


The textbook is geared to introduce several mathematical topics at the rudimentary level so that students can 
appreciate the applications of mathematics to the interdisciplinary field of biomedical physics. Most of the topics 
are presented in their simplest but rigorous form so that students can easily understand the advanced form of 
these topics when the need arises. Several end-of-chapter problems and chapter examples relate the 
applications of mathematics to biomedical physics. After mastering the topics of this book, the students would 
be ready to embark on quantitative thinking in various topics of biology and medicine. The famous renaissance 
philosopher and astronomer, Galileo Galilei, is quoted to say, “Mathematics is the language in which Nature has 
written the book of Universe”. This textbook is an endeavor to teach the language of Nature in a careful, yet 


simple, manner to undergraduate students. 


Chapter 1: Differential Calculus 


We will start with a review of differential calculus, assuming that the reader has already seen derivatives for 
functions of a single variable. After a brief review of simple derivatives, we will introduce the derivatives of 


functions of multiple variables, also known as partial derivatives. 
1.1 DERIVATIVE OF A FUNCTION OF A SINGLE VARIABLE 


F(x) is a function of a single independent variable x as shown in Figure 1.1. An independent variable means that 
its value can be assigned at will, without any constraining conditions. By definition, the derivative of F(x) is < 


given by 


ge ie ee Eq. (1.1) 


dx  h-0 


F(x) 


Figure 1.1. F(x) is a function of a single independent variable x. 


As a simple example, consider the derivative of F(x) = sin (x), 


dF , sin(x + h) — sin(x) 


—= lim 
dx  h-0 h 


In the numerator, we use the trigonometric identity 


A-B A+B 
sinA —sinB = 2 sin ( 5 ) cos (=) 


to get 


aes ona az, a 
Or, 
dsin(x) _ i sin(h/2) 
agg Fe hp 


The limiting value of this expression can be easily obtained, using simple geometry and trigonometry (see 


Appendix A), to be 1. Using this limiting value, we get the result 


dsin(x) _ 
aS cos(x). 


A similar procedure can be used to determine derivatives of other well-known functions. Here is a compilation of 


derivatives of several commonly used functions: 


dC 
—=0, C isaconstant 
dx 
dx” 
dx =I 
d exp(x) 
Fe = exp x) 
din(x) 1 
dx x 
dcos(x 
os ) sin(x) 
dtan(x) 
7 ec’ x 
darcsinx _ 1 
dx V1 —x? 
darccos x - 1 
dx L=x2 
d arctan x _ 1 
dx ~ 14x? 


Higher Order Derivatives 


Assume that the derivative < of the function F(x) is represented by a new function F;, (x); that is, F(x) = < 


a 


2 
Then, “ is the second derivative of F(x), namely, ~ Similarly, if F(x) = , then 2 = is the third derivative of 


F(x), namely, <= = Using this procedure, higher derivatives of any order can be determined for a function of a 
single variable. 


Chain Rule 


When a function F of variable x can be expressed as a function of another function as F[.g(x)], then the 


derivative of F can be most conveniently evaluated using the chain rule as 


dF dF dg SAD 
dx dg dx’ q. (1.2) 


Example: Using the chain rule, determine the derivative of the function F(x) = Vx? + a?. 


Solution: To determine the derivative < first define a new function g(x) = x? + a?. Then, F(x) can be written 


as a function of g(x) as F(x) = ./g(x) = g/2. Using chain rule, 


dF dFdg 1 4/2 x 
—=>>=59 (2x) = ———. 
dx dgdx 2 Vx2 4+ qe 


Product and Quotient Rules 


When a function F of variable x can be expressed as a product of two simpler functions as F(x) = f(x) g(x), 


then the derivative of F (x) is given by 


a — df (x) 


== fe) — 


g(x). Eq.(1.3) 


In words, if F(x) = (FIRST)( SECOND), then 


d (SECOND) | d (FIRST) 


= (FIRST 
( ) dx dx 


(SECOND). 


Example: Using the product rule, determine the derivative of F(x) = x*Inx. 
Solution: Derivative of F(x) using the product rule is 


dF 2dinx dx? _ f4 44 err 
am. * Ag nx T= x : nx (2x) = x+2xInx. 


A oe case of the product rule is the quotient rule, which is used when the function F (x) can be expressed 


as F(x) = ae In this case, 
| df dg eile 
am pee) ae fC) a]: q. (1.4) 


Example: Using the quotient rule, determine the derivative of F(x) = exp(ax) /x. 
Solution: The derivative of F(x) using the quotient rule is 


dF 1 d exp (ax) dx] a 1 ( 7 ax—1 
ao el ae exp(ax) mal =~ oxP(ax) xz oP ax) = exp(ax) 


Interpretations of a Derivative 


The derivative < of the function F(x) has two distinct interpretations. 


dF (x) 


First, the value of a derivative, , atx = ais the slope of the line that is tangent to the function F(x) atx = 


a. Now, since the slope of a function at a point where the function has its minimum or maximum value is zero, it 
follows that the derivative of the function at its extremum points (that is, points with minimum or maximum 
values) is also zero. So, the extremum (minimum or maximum) points of a function can be determined by setting 
dF (x) 
dx 


its derivative equal to zero. The equation —— = 0 can be solved to obtain the values of x = Xj, where the 


. 2 fei . . 7 ihre sl dF (x 
function is minimum or xX = X mq, where the function is maximum. In the vicinity of x,,;,, the values of <2 are 
dx 
negative for x < Xin, zero for X = Xmin, and positive for xX > Xmin. In other words, <> <* is positive at X = Xmin- 


Similarly, in the vicinity of x,.¢,, the values of ~™ are positive for x < Xmax, zero for X = Xa, and negative for 


X > Xmax- In other words, = “is negative at X = Xmax- 


Example: As an example of this first interpretation of a derivative, let us look at the case of a young mom, 
sitting on a bench on a concrete patio, watching her small child playing on the grass nearby, as shown in the 
Figure 1.2. The lengths d,, and d, are the shortest distances of mom and child, respectively, from the patio- 
grass boundary. Mom can run on the patio with speed v, and on the grass with speed v,. When an emergency 
arises, the mom would like to reach the child in the shortest possible time. Which path should she take during 


an emergency? 


path 1 


Patio 


Figure 1.2. The quickest path a mom can take to reach her child in distress. 


Solution: The mom can either take path 1 or path 2 (shown in the figure) or any path in between. Consider an 
arbitrary path, as shown in the figure, taken by mom to rush to her child. Total time taken by mom using this 


path is 


_ Gee , EF Oa? 


Vp Vg 


Here L is the separation between the mom and the child along the patio-grass boundary. The distance x, shown 


in the figure, will vary depending on the specific path taken by the rushing mom. The path with the shortest time, 


according to the above interpretation of the derivative, will be the one for which < is equal to zero. Thus, 


dt 2x 2(L —x)(-1) 


— = + —— = 0, 
dx 2v,fdz+ x? 2v, fd? + (L—x)? 


or 


1 x 1 (L —x) 


Up fag+ x2 y fde+ ~L—xy 
or, in terms of the angles 6,, and @, shown in the Figure 1.2, 


sin A, _ sin 6, 


This relationship describes the path that mom should take to reach the child in the shortest possible time. It is, 
essentially, Snell’s law of refraction in optics that concerns the bending of light as it travels from one medium 
(number 1) into another medium (number 2). Mathematically, Snell’s law is stated as n, sin 8, = nz sin 8, 
where n, and n, are the indices of refraction of the two media. Light travels with different speeds in media with 
different indices of refraction with v, = c/n, and vz = c/n, (c being the speed of light in vacuum). Thus, Snell’s 
law can be expressed as sin 8, /v, = sin 0,/v,2. Stated differently, Snell’s law implies that when light travels 
from a point in the first medium to another point in the second medium, it takes a path that minimizes its time of 


travel. 


Second, the derivative ane represents the rate of variation of function F(x) with x. Suppose that when 


independent variable x changes by a small amount Ax, then the corresponding change in the value of the 


function is AF’. Using this interpretation of the derivative, 
(Change in F) = (Rate of change of F with x) : (Change in x) 


or, 
AF) = — (A Eq. (1.5 
( ) 1 ¢ x). Gat ) 


The idea about the rate of change of a quantity appears in many diverse areas of knowledge. For example, in 
physiology we talk about the rate at which blood flows in veins; in geography we talk about the rate at which 
population grows in a certain area; in meteorology we talk about the rate at which pressure varies with altitude; 
in medicine we talk about the rate at which a cancerous tumor grows or a contagious virus spreads; in sociology 
we talk about the rate at which a news or some rumor spreads; in psychology we talk about the rate at which 
different people learn certain skills (that is, the learning curve); in physics we talk about the rate at which the 
velocity of an accelerating automobile changes, etc. No matter in which context we talk about the rate of change 


of a quantity, its mathematical description would always be presented in the form of a derivative. 


1.2 FUNCTIONS OF MULTIPLE VARIABLES: PARTIAL DERIVATIVES 


Recall that for an independent variable, any value can be assigned to it without any constraining conditions. If F 
is a function of two or more independent variables, then the derivative of F with respect to one of the variables, 
while holding the other variable fixed, is called the partial derivative. Formally, if F(x, y) is a function of two 


independent variables x and y, then 


OF (x,y) 2 ge ee) — F(x, y) 


Ax lim i Eq. (1.6a) 
ORGY) Vee hay ER By) 
es er ene Eq. (1.6b) 


, ware a a ‘ ees ae 
are the partial derivatives of F. Note the use of m4 and a for partial derivatives versus = for derivatives of a 
: ' . ; es : ae : a 
function of only one independent variable. A common notation for writing partial derivatives also includes (=) 


and (+) in which the fixed variable is written as a subscript. 
x 


Example: Determine the partial derivatives of the function F(x, y) = x? —xy? + y. 


Solution: Since x and y are two independent variables, the partial derivatives of F(x, y) with respect to x and y 


are: 


OF (x,y) _ 


3 Zi 2. 
Ox . y 
OF (x,y) 
——— = -2 1. 
ay xy + 


As before, < (or ) represents the rate of variation of F with x (or y). If both x and y vary independently, then 


(Change in F) = (Rate of change of F with x) - (Change in x) + (Rate of change of F with y) 
‘(Change in y), 


or 


OF OF 
(AF) = ae (Ax) + ay (Ay). Eq. (1.7) 


Higher Order Partial Derivatives: Clairaut’s Theorem 


OF OF OF, a°F OF, a°F OF, a°F OF. a°F be fo 
IlfE, =— and F, = — then— = —, = _= and — = — are the second derivatives of 
Ox yay Ox ax2’ dy ayax’ ax axdy oy dy? 


F(x, y). Higher order partial derivatives can be defined in an analogous manner. The Clairaut's theorem states 


that 


Ox 


OF, — OF, 0 (<=) 0 (—) 
dy dx ay ; 


ax) ~ ax \dy 


In words, it means that for a function of several independent variables, the multiple partial derivatives can be 


carried out in any order. 


Example: Verify Clairaut’s theorem for the function F(x, y) =x? —xy? + y. 


Solution: In the previous example, we calculated < = 3x? —- y* and = = —2xy + 1.So now 
0 (=) eee 
ay \ax) ~~ 
0 (S) ee 
ax\dy) 


so the Clairaut’s theorem is indeed satisfied. 
Example: Check whether the Clairaut’s theorem for the function F(x, y) = exp(ax) sin(by) is satisfied. 


Solution: In this case, there are two independent variables x and y. So, first partial derivatives are 


OF (x, 

GEG) = aexp(ax) sin(by), 
Ox 

OF (x, 

eee) = exp(ax) bcos(by). 
dy 

The second derivative gives 
0 


— (=) = a exp(ax) bcos(by) 


a (5) = aexp(ax) b cos(by) 


that verifies Clairaut’s theorem. 


oF OF OF 
— represent the 


If F is a function of n independent variables, that is, F = F(x1,%2,X3, Xn), then —— omy? Be 
xn 


rates of variation of F with x1, X2,...X,, respectively. Since all n variables can pe independently, then 


(Change in F) = (Rate of change of F with x,) - (Change in x,) + (Rate of change of F with x2) 
- (Change in x.) + ::- + (Rate of change of F with x,) - (Change inx,), 


or, 
= OF OF 
(AF) = Oe aos ae +: ea — on) Eq. (1.8) 


(AF) is the total change in function F when all n variables are changed independently. It is called total 
differential. Ata point where F is extremum (namely, a minimum or a maximum), AF = 0. Since x1, Xz, ...X%, are 
independent variables, they may be chosen so that all but one of the Ax, in turn, are zero. It follows that 

OF 4 OF 4 OF ae OF ' Pee 

— = 0,— =0,...—— = 0 and —— = 0. o (il 

Ox, OX> Ona OXn EE) 
These n algebraic equations can be solved to find the values of variables x,, x2, ...X, that make the function F 


extremum. 


The idea of a total differential, introduced above, is very useful in estimating the largest possible error in the 
measurement of a physical quantity that is a function of several variables. An example will help in understanding 


the concept. 


Example: According to the Poiseuille’s equation, the total volume of blood flowing through a blood vessel of 


4 
radius R and length L per unit time is given by F = k ~, where constant k is independent of the geometry of 


the blood vessel. If the relative error, AR, in the measurement of the radius is 2% and the relative error, AL, in 
the measurement of length is 4%, then what is the largest possible relative error, AF, in the measurement of 


flux of blood, F, through this blood vessel? 


Solution: The relative error in the measurement of a quantity means error in measuring that quantity divided by 


A 
the actual value of that quantity. In other words, the ratios =, = and Fare, respectively, the relative errors in 


the measurements of the radius, length and flux of the blood through a blood vessel. These ratios are 
determined by first taking the natural log of the Poiseuille’s equation and then taking the derivatives of both 


sides as 


InF =Ink + 4lnR —-InL 


and 


Since error in the measurement of radius is 2%, it means that the value of = ranges between —0.02 and +0.02. 


Similarly, the value of = ranges between —0.04 and +0.04. Thus, the largest possible value of is 4(+0.02) — 


(—0.04) = 0.12. Or, the largest possible relative error in the measurement of flux of blood is 12%. 


Now, going back to the function F = F(x, X2, X3, ...X,) of n variables, if x1, x2, ...x, are constrained by one or 


more relationships of the form, 
D(X, Xo, Xp) = constant, 


then not all n variables are independent. In fact, if F is a function of n variables and if there are m (m < n) 
constraining relationships among variables, then the number of independent variables is only n — m. In this case 


one can use the method of Lagrange multipliers to find the extremum value of the function F’. 
Method of Lagrange Multipliers 


Instead of considering a general function of n variables, let us focus our attention on a function F(x, y, Z) of 


three variables, x, y and z. These variables are constrained by the relation 
(x, y,z) = constant. 


Because of this constraint only two out of three variables are independent. Let us choose x and y to be the 
independent variables. Since we wish to determine the extremum value of F(x, y, z), we take its derivative and 


set it equal to zero: 


—A —A — Az=0. Eq. (1.10 
- x + Fr y+ - Z q.( a) 


@ 
—A —A — Az=0. Eq.(1.10b 
= x + Fa Waa A Z q-( ) 


We multiply Eq. (1.10b) by a multiplier, —A, and add it to Eq. (1.10a) to get 


10 


OF O® OF O® OF O® 
Cee a a ae 
ly ly Oz Oz 


This choice of A will remove the term containing Az, leaving only the two independent variables x and y. As x 


and y are independent variables, we can choose them, in turn, so that Ax = 0 and Ay = 0 separately. It then 


follows that 
OF gece a 
ax Ox’ 
and 
OF Om = 
ay ~ ay 


The last three equations, along with the equation of constraint, provide the values of the multiplier A and the 


optimal values of the variables x, y and z that make the function F extremum. The constant A is called Lagrange’s 


undetermined multiplier. 


Example: A box (sides a, b,c) of fixed volume V is to be designed so that its surface area is minimum. Find the 


optimal values of a, b,c. 


Solution: 


Fixed volume, V 


Figure 1.3. A box of fixed volume V whose surface area is minimized. 


In this case, the equation of constraint is 


11 


V = abc = constant. 


Because of this constraint, only two sides of the box can be changed independently—the third side will be 


determined by the equation of constraint. We choose a and b as the independent variables. 
We need to find the minimum value of the surface area of this box, which is 
S =2(ab+bc+ca). 


Setting the derivative of S equal to zero, we get 


0s os Os 
AS = — Aa+ — Ab+— Ac =0, 


da Ob dc 

or, after dividing by the common factor of 2, 
(b+c)Aa+ (c+a)Ab+(a+b) Ac =0. Eq. (1.11a) 
Also, the volume V is constant. So, 
AV = heh be hoe 

0 Ob Oc 

or 
(bc) Aa + (ca) Ab + (ab) Ac = 0. Eq.(1.11b) 


Multiply Eq. (1.11b) by undetermined multiplier —A and add it to Eq. (1.11a) to get 


[(b +c) — A(bc)] Aa + [(c + a) — A(ca)] Ab + [(a+ b) —A(ab)] Ac=0. Eq. (1.11c) 


The multiplier A is still undetermined and can be chosen at will. We choose A = (a + b)/(ab) so that term 
containing Ac is removed in Eq. (1.11c). Also, since a and b are chosen as independent variables, it follows from 


Eq. (1.11c) that 


(b +c) —A(bc) = 0, 


and 


(c +a) —A(ca) = 0. 


From these three relationships, we get 


12 


A(abc) = a(b+c) = b(c +a) =c(at+b). 


Or 


a=b=c. 


Thus, a box of a fixed volume V and a minimum surface area is a cube. 


Example: Determine the ratio of radius, 7, and height, h, of a right circular cylinder of fixed volume V, that will 


make the surface area, S, of the cylinder a minimum. 


Solution: 


Fixed volume, V 


Figure 1.4. A cylinder of fixed volume V whose surface area is minimized. 


In this case, V = 2r*h = constant is the equation of constraint. Both r and A can vary, but only one of them is 
independent. Let us choose r to be the independent variable. The surface area, S, of the cylinder consists of two 


end circles and the curved surface of the cylinder. Thus, 


S = 2nrh + 2nr?. 


From the constraint condition 


ae Oar ea 
or oh , 
or 
2nrh Ar + mr? Ah = 0, 
or 


2h Ar+ rAh=0. Eq.(1.12a) 
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Also, when the surface area, S, is a minimum, then 


AS = 2 hed hh 
or Oh : 
or 
(27h + 4nr) Ar + 2nr Ah = 0, 
or 


(h + 2r) Ar+ rAh=0. Eq. (1.12b) 
On multiplying Eq. (1.12b) by undetermined multiplier —A and adding it to Eq. (1.12a) we get 
[2h —A(h+ 2r)] Ar + [r —Ar] Ah =0. Eq.(1.12c) 


The multiplier A, which can be chosen at will, is taken as A = 1. This choice of A removes the term containing Ah 


in Eq. (1.12c). Also, since r is an independent variable, we can choose Ar # 0 which makes 
2h —(h+2r) = 0, 
or 
h=2r. 


Thus, the right circular cylinder of a fixed volume V will have the least surface area when the height of the 


cylinder is equal to its diameter. 


Let us try to construct some common solids of different shapes, using playdough of a fixed volume V, such that 


the solid has a minimum surface area, S. 
If we construct a playdough cube of side length L, thenV = L? andS = 61? = 6V7/3, 


On the other hand, a playdough cylinder with a minimum surface area will have a height equal to its diameter. 
Thus, if R is the radius of this cylinder, then V = mR2(2R) = 27R?. The surface area is S = 2(mR7) + 
2/3 
(2nR)(2R) = 6nR? = 60 (=) = 5542/3, 
Finally, a playdough sphere of radius R will have its volume as V = (=) R? and its surface area as S = 470 R? = 


An a = 4.84 2/3, 
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Thus, among these common solids, all of the same volume V, the sphere will have the smallest surface area. This 
may partly explain why all the celestial bodies — stars, planets, and moons — are spherical in shape. It also 
explains why soap bubbles and raindrops tend to be spheres. In other words, Mother Nature prefers to make 


shapes with least surface area, for a fixed volume of its own playdough. 


Chain Rule for Partial Derivatives 


Recall that chain rule of calculus is applicable when we need to take derivative of a function of another function. 


Case I: The function F (x) is a function of a single variables x which itself is a function of two other variables, s 
and t, that is, x(s,t). Then, in principle, F is a function of two variables, s and t. The derivatives of F with respect 


tos andt are 


OF _ dF 0x 

ds dx 0s’ 
and 

OF _ dF ox 

Ot dx Ot’ 


Example: Given the function F(x) = x? with x(s,t) = s + t, determine the derivatives < and ~ 


Solution: 


TG 1)=2 
Se = (2x)(1) = 2(s +1) 


and 


us 2x)(1) = 2(s+t 
5p = (2x)() = US +0). 
Alternatively, 
F(s,t) =(s +t)? 


and, directly, the partial derivatives of F(s, t) with respect to s or t provide the same results as those obtained 


by using the chain rule for partial derivatives. 


Case Il: F(x, y) is a function of two variables x and y, and x and y themselves are functions of a single variable t. 


Then, in principle, F is a function of a single variable t and the derivative of F with respect to t is, 
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dF OF OF dy 
dt dxdt dy dt 


Example: Given the function F(x, y) = In(2x + 3y) withx = t* and y = sin(3t), determine the derivative 


dF 


dt* 
Solution: Using the chain rule for partial derivatives, 


dF OF is OF dy _ 2 Set i: 3 3 3)] = 4t + 9cos(3t) 
den de” Oy dt 2eray OO pay Jee asneD 


Alternatively, we could write 
F(x, y) = In[2t? + 3 sin@t)] 
and then use the rule, for taking derivative of a function of a single variable, to get the same result. 


Case Ill: The function F(x, y) is a function of two variables x and y, and the two variables x and y themselves are 
functions of two other variables, s and t; that is, x(s,t) and y(s, t). Then, in principle, F is a function of two 


variables, s and t. The derivatives of F with respect to s and t are 


OF OF Oe 08 oy 
ds dx ds dy as’ 


and 


OF OF Ox OF Oy 


Ot dx dt dy Ht 
Example: Given the function F(x, y) = x7 + xy + y* with x(s,t) = s+tand y(s,t) = st, determine the 


si aF oF 
derivatives — and —. 
Os ot 

Solution: 


Using the chain rule for partial derivatives, 


OF OF 0x _ OF dy 


ae ao ae Dy as ee eR ee ee SOS eer ese 


and 
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OF OF 0x _ OF dy 


ae ae ag Bye ee Oo ERS Reese): 


Alternatively, one could express F(x, y) as F(s,t) in the form 
F(s,t) =(s +t)? +(s+ost + s?t?. 


Taking the partial derivatives of F(s,t) with respect to s or t will lead to the same results as those obtained by 


using the chain rule for partial derivatives. 
1.3 WAVE EQUATION 


As an application of partial derivatives, we will derive an equation, the wave equation, which describes a periodic 
function. A function is periodic when it repeats itself either in time or in space. Periodic phenomena are very 
common in a variety of fields including biology, physics, chemistry, astronomy, and so on. In biology, periodic 
phenomena include cell cycle, cardiac cycle, circadian rhythms, ovarian cycle, and metabolic cycle. In physics, 
examples of periodic phenomena include a swinging pendulum, mass on a spring, and water waves (or ripples). 
In chemistry, periodicity is found in the properties of chemical elements and in oscillating chemical reactions. In 
astronomy, there are abundant examples of periodic phenomena such as motion of satellites and the Moon 
around the Earth, motion of planets around the Sun, and the associated periodic occurrences of seasons, tides, 


and the day-and-night cycle. 


One of the simplest mathematical functions that repeats itself is the sine (or cosine) function. For example, 


sin(kx) is a periodic function that repeats itself in spatial coordinate x. In fact, the value of this function at some 


point Xq is same as its value at Xp + = In other words, the wavelength [or, the length over which the function 
repeats itself] of this function is A = =. Similarly, sin(wt) is a periodic function that repeats itself in time t. The 
value of this function at some time fy is the same as its value at ty + = In other words, the period [or, the time 
over which the function repeats itself] of this function is T = = Note that - measures the number of times the 
function repeats itself in a unit time. Therefore, =, that measures how frequently the function repeats itself, is 


called the frequency, f, of the wave. Thus, f = - = 
Now, a wave, according to its dictionary meaning, refers to “a disturbance on the surface of a liquid body, as the 
sea or a lake.” We can set up a water wave (commonly called a ripple) by throwing a large stone in a lake. This 
wave will look like a series of surges which are progressively moving outwardly away from the point where stone 
touched the water. If we throw a bottle cork in the disturbed water, we will observe that the cork will be simply 


bobbing up and down at its fixed location, without moving horizontally with the wave, as surges pass by it. It 
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indicates that, in case of ripples in water, the wave is moving outward while the water itself is not moving with 
the wave. We can investigate the behavior of this wave either as a function of spatial coordinate x or asa 
function of time t. Using a camera, if we take the photograph of this wave, it will look like Figure 1.5 which 
shows, at a fixed time [the time at which photograph was taken], the wave as a function of x. On the other hand, 
we could focus our attention at some fixed point, say the bobbing cork with spatial coordinate x, and measure its 
displacement with respect to the horizontal level of calm water as a function of time t. Figure 1.6 shows this 
displacement, for a fixed value of x [location of cork], as a function of time t. From Figures 1.5 and 1.6 we note 
that this wave is periodic in both x and t. If we represent this wave mathematically by a sine function, we get a 


sinusoidal or a harmonic wave. 


At constant time, f 


Point P 


Position x 


Dsiplacement of water 


Figure 1.5. Periodic function sin(kx — wt) as a function of x for a constant value of t. 


We note, in passing, that because of the identity 
_ ft 
sin (5 + 0) = cos(@), 


for a given angle @, the sine function and the cosine function look the same except the sine function is ahead of 
cosine function by a phase of 7/2. In general, if a function looks like a sine or a cosine function, we will refer to it 


as a sinusoidal function. 
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At the same position, x 


Time t 


Dsiplacement of water 


Figure 1.6. Periodic function sin(kx — wt) as a function of t for a constant value of x. 


The mathematical function 


F(x,t) = Asin(kx — wt) Eq. (1.13a) 


represents the sinusoidal wave of figures 1.5 and 1.6. The function F(x, t) represents the displacement, at time 


t, of a point in water located at position x. The largest value of the displacement is A, which is called the 


amplitude of the wave. The wavenumber k is related to the wavelength J as k = = The angular frequency w is 


related to the period T as w = =. The argument of the sine function, namely (kx — wt), is called the phase of 


the wave. Each point on the wave, such as point P in Figure 1.5, has a fixed constant value of phase which does 
not change as the point P moves along with the wave. From Eq. (1.13a), as t increases, x also must increase to 


keep the phase constant. Thus, this wave travels along the + x direction. Similarly, a wave of the form 
F(x,t) = Asin(kx + wt) Eq.(1.13b) 
travels along the - x direction. 
The phase velocity refers to the speed of an arbitrary point, like P of some fixed phase, on the wave. For point P, 
phase = kx — wt = constant. 
Thus, 


— _ 
a 


19 


or 


w 
— = Uh Eq. (1.14) 
k 
Here v is called the phase velocity of the wave. Thus, 
F, (x,t) = Asin(kx — wt) 
and 
F_(x,t) = Asin(kx + wt) 


are the sinusoidal (or harmonic) waves travelling along +x and - x direction, respectively. These waves are 


moving with a (phase) velocity of 


Using partial derivatives of 
F(x,t) = Asin(kx — wt) 


we get 


on ih k. t 
ax cos(kx — wt) 


0?F ia i 5 
age —k*Asin(kx — wt) = —k*F 
OF 
reais WA cos(kx — wt) 
0°F er - 
a2 = —w*Asin(kx — wt) = —w*F 


Combining these equations, we get 


Finally, using w = kv, where v is the phase velocity, we have 
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This is known as the wave equation. Equations of this kind, which relate various partial derivatives, are called 


partial differential equations. 


In the above discussion, we derived the wave equation by starting from sinusoidal (or, sine- and cosine-like) 
functions. However, the general solutions of the wave equation are not necessarily sinusoidal. We will now show 
that general solutions of the wave equation are not merely functions of variables x and t, but are functions of 
the combinations x + vt and x — vt. To show this, we make a change of variables from x andt tor =x+vt 


and s = x — vt. Using chain rule for partial derivatives, 


SLL LL 
ax dr dx Os Ox Or ds \Ar' ds)’ 


OF _ OF or | OF @s_ OF OF eo a 
Wi Gee os Be er 


Or, in operator notation, 


Then, 


oro (S)=(2 \iG oe. O*F 0°F 
ax? ax \dx. ar Or ds) dr? “drds ° as?’ 


af ae 
ore (=) = (: =) (a 2") - ; 0°*F gee 
ate at\at) \ar as) Nar as)” \ar? “aras | as2}" 


Thus, the wave equation becomes, 


O*F 1 OF 00°F _ 
0x2 v20t2 ards 


It implies that 
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as well as 


0 ($) Ls 
as \ar) 
In words, it implies that < is independent of r and < is independent of s. Thus, the function F can be expressed 


as a sum of two separate functions, g and h, such that g is a function of variable r only and h is a function of 


variable s only. So, a general solution of the wave equation is of the form, 
F=g(r)+h(s), 
or 
F(x,t) = g(x+vt)+h(x — vt). 


We note in passing that the two sinusoidal functions that we encountered previously are indeed functions of 


variables r = x + vt ands = x — vt. Explicitly, using w = kv, 


F,(x,t) = Asin(kx — wt) = Asink(x — vt), Eq.(1.16a) 


and 


F_(x,t) = Asin(kx + wt) = Asink(x + vt). Eq. (1.16b) 


1.4 IMPLICIT DERIVATIVES 


Consider the relationship y* + sin x sin y = x between two variables x and y. Both x and y can vary, though not 


independently since a change in x leads to a change in y and vice versa. Thus, either y can be treated as a 


function of a single variable x, or x can be treated as a function of the variable y. We can determine oa the rate 


of change of y with x, or = the rate of change of x with y. We note that it is not easy to write either x as a 


function of y or y as a function of x. Starting with the relationship between x and y, we first differentiate it with 


respect to x to get 
jy 4 iny + si ery 
yz + cosxsiny +sinxcosy — = 1, 


or 


d 
(2y + sinx cos y) = 1—cosxsiny, 
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or 


dy  1-cosxsiny 
dx 2y+sinxcosy’ 


Again, starting with the relationship between x and y, we next differentiate it with respect to y to get 


aay 4 Ges _ dx 
parce siny 5 eccosy =a 
or 
dx 
(1 -—cosxsiny) —— = 2y +sinx cosy, 
dy 
or 


dx 2y+sinx cosy 
dy 1 —cosxsiny ° 


These kinds of derivatives, which contain both variables x and y, are called implicit derivatives. Note in passing, 


in this example, 


wee z Eq. (1.17 
Ame GBs q. (1.17) 
dy 


which is a general property of derivatives of a function of a single variable. 


As another example, consider the relationship x + y = exp(xy) between variables x and y. We first 


differentiate this relationship with respect to x to get 
dy dy 
1+ i exp(xy) {x= + y}, 
or 
dy 
{x exp(xy) — 1} 5 = 1-yexp(xy), 


or 


dy _1—yexp(xy) 
dx xexp(xy)-1 
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Next, differentiate the original relationship with respect to y to get 


dx oe % dx 
ag = exp(xy) {x ae 


or 
dx 
{1 -— yexp(xy)} dy = xexp(xy) - 1, 


or 


dx _xexp(xy)—-1 
dy 1—yexp(xy) 


Again, the reciprocity relationship of Eq. (1.17), namely, 


dy 1 
dx dx’ 
dy 


is satisfied since we are dealing with a function of a single variable here. Now, let us explore whether this 
reciprocity relationship also works for partial derivatives. As an example, consider the coordinates of a point ina 
plane. The Cartesian (x, y) and the plane polar (p, @) coordinates of the point are related to each other. The x 


and y coordinates can be expressed as functions of variables p and @, 


x=pcosd¢d, y=psing. 


Conversely, the p and ¢ coordinates can be considered as functions of variables x and y, 


p= jx? +y4,@= tan? (7). 


x 


From here, 
y 
Ob y 
Ox yy? x2 4 y2’ 
es *) 
and 


aa —psing = —y. 
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Clearly, <e # = The reason is that the variables being held fixed are different for each of the two cases. In the 
op 


above example <e is actually (*) , that is, variable y is held fixed while evaluating this derivative. Similarly, = is 
y 


(=) since variable p is fixed during evaluation of this derivative. 


Legendre Transformation 


Suppose F(x, y) is a function of two independent variables x and y. It is possible to define two new variables u 
and v, which are combinations of x and y [for example, u = x + yandv = x — y], and, inversely, x and y are 
functions of variables wu and v. Now if x and y in F are replaced by u and v, then F will become a function of two 


new variables wu and v. Thus, 


F(x,y) > F[x(u, v), y(u, v)] = G(u, v). 


The procedure for replacing one set of independent variables in a function by another set of independent 
variables is accomplished, in general, by Legendre transformation. The change of independent variables is helpful 
in classical mechanics in discussion of canonically conjugate variables, and in thermodynamics in the discussion 


of Maxwell relations. The transformation procedure is described below. 
For F(x, y), we can write 


ape oe eae, =pdx+qd 
~ ae dy y=upaxrqay, 


where p(x, y) = < and q(x,y) = 7 Now, define three new functions, 
f =F@y)— py) x, 
g=Fay)-—a@%y)y, 


and h= F(x,y) — p(x, y)x — q(x, y)y. 


Then, 
df =dF —pdx—xdp=-xdp+qdy, 


dg =dF —qdy—ydq =pdx —ydq, 


dh = dF —pdx —xdp—qdy—ydq =—xdp—y dq. 
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Thus, f is a function of variables p and y, g is a function of variables x and q, and h is a function of variables p 


and q. The partial derivatives of f(p, vy), g(x, q) and h(p,q) are 


Of(py) _ f(y) _ 
———St SS , ya, © Oe + , 
Op dy 
0g(x%,q) _ 0g(x,q) _ 
= +p , = ae 
Ox 0q 
Oh(p,q) _ dh(p,q) _ 
ap , aq. 
Finally, using the Clairaut theorem, 
eS oe) Eq. (1.18a) 
=| )) =4+(=} , .(1.18a 
dy, Op! , | 
dp dy 
ar ea = alae) Eq. (1.18b) 
alae Eq. (1.180) 
=o (ieee |= ee ee .(1.18¢ 
dq’, Op! , z 


These three relations along with the original relation 


0 0 
ey Seas Fa. (1.184) 
dy! dx) y 


are the basis of Maxwell’s relations in thermodynamics. Appendix B describes an easy mnemonic device to 


remember Maxwells relations with correct signs of terms. 


PROBLEMS FOR CHAPTER 1 


1. Given F = exp(x) siny, x = uv?,y = uv, use the chain rule to find 


Write the result in terms of variables x and y only. 
2. Consider a function of a single variable x, F(x) = x? — 6x” + 9x + 4. 
(a) Determine the values of x at which the function F(x) is an extremum. 
(b) At the extremum points, does the function F(x) have a minimum or a maximum value? 


3. Biomedical Physics Application. A common inhabitant of human intestines is the bacterium Escherichia coli. A 
single cell of this bacterium in a nutrient-broth medium divides into two cells every twenty minutes. The initial 


population of a culture is 250 cells. 
(a) Find an expression for the number of cells after t hours. 
(b) Find the number of cells after 5 hours. 
(c) When will the population reach 128,000 cells? 


4. Biomedical Physics Application. At 8:00 AM in the morning, a biologist starts a six-hour experiment with 
bacterium Escherichia coli. As mentioned in problem 3, a cell of this bacterium in a nutrient-broth medium 
divides into two cells every twenty minutes. At 12:00 PM the biologist measures the cell population to be 


2,048,000. 
(a) What was the initial population of the bacterium cells in the beginning of the experiment at 8:00 AM? 
(b) What will be the final population of the bacterium cells at the end of the experiment at 2:00 PM? 

5.1f V(r,0) = (ar” + Br-™) cos(n@) where a, B and n are constants, find 


av av av 
Or’ Or? BEF 


Hence show that 
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—+-—+s5=5= 
Or? ror r?* 002 
6. The pressure P, volume V. and temperature T of a certain gas are related by the van der Waals’ equation of 


state, 
(P+ =z) V5) = RT, 


2 
where a, b and R are all constants. Find the values of P,V and T for which (=) =0 and (=) = Oare 
T T 


satisfied simultaneously. These particular values of pressure, volume and temperature are called critical values 


(P., V. and T,.). One can define reduced parameters as 


Show that the van der Waals’ equation can be recast, in terms of the reduced parameters, in the following 


invariant form: 


7. Biomedical Physics Application. In all mammalians, including human beings, the rate of growth of skull is 
known to be different from the rate of growth of backbone. The allometric relationship between skull size S(t) 


and backbone length B(t), at age t, is 
S(t) = aB(t)?, 
where a = 1.16 and b = 0.93. 


1 dB 


(a) Determine the relationship between the relative growth rates, - — and ee of skull and backbone, 


respectively. 


(b) Which part of the body, skull or backbone, grows faster than the other? 


2 


aa? , 
+ aaz IS called the Laplacian operator. 


+ a2 


a2 
ax? 


8. If f(x, y,z) = 1/(x? + y? + 2”), calculate V2, where V2 = 


9. For f(x,y) = x3 —y? — 2xy + 6, find the values of 07 f/x? and 07f /dy? at the points where = 


0 as well as OF a 0. 
dy 


28 


10. Show explicitly that arbitrary functions F(x — vt) and F(x + vt) are solutions of the wave equation 


0*F «61 0°F 


ax? v2 at? 
11. Biomedical Physics Application. All human beings belong to one of the four main blood groups (types of 
blood) — A, B, AB and O. Blood group O is the most common type of blood in the world. Mixing blood groups can 
lead to a life-threatening situation. The blood type of a person is determined by three alleles (a variant form of a 
gene), A, B and O that are inherited from parents, one from father and other from mother. The inherited genes 
join to form the blood groups A (joining A with A or A with O), B (joining B with B or B with O), AB (joining A with 
B) and O (joining O with O). According to the Hardy-Weinberg Law of genetics, the fraction of population that 


carries two different alleles is 
P=2xy + 2yz + 2zx 


where x, y and z are the fractions of alleles A, B and O in the population. Using the fact that x + y+z=1, 


show using the Method of Lagrange Multipliers that P can be at most 2/3. 


12. Assume that a and Db are the side lengths of a right-angle triangle. The size of the hypotenuse, h, of this 
triangle is fixed at 5V2 m. Using the Method of Lagrange Multipliers, determine the lengths of the sides a and b 


when the area of the triangle is extremum. 


13. Biomedical Physics Application. The drug response function R(t), describing the level of medication in the 


bloodstream after a drug is administered, can be represented as 
R(t) = R, t~3/? exp(—a/t ) 


where Ry and a depend on the nature of the drug. For a particular drug Rp = 0.01 anda = 0.138, and t is 
measured in minutes. Determine the time, t, at which the level of the medication in the bloodstream is 


maximum. 


14. Biomedical Physics Application. In the angioplasty procedure, a “balloon” is inflated inside a partially 
clogged artery to restore the normal blood flow. The volume of blood flowing per unit time past a given point, 


F,, is proportional to the fourth power of the radius, R, of the artery carrying the blood (Poiseuille’s equation), 
F=k R*. 
What will be the relative change in F (that is, “) when an artery is constricted by a 2% change in its radius due 


to clogging? 
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Chapter 2: Integral Calculus 


In this chapter we will first review integral calculus, assuming that the reader has already seen integrals of 


functions of a single variable. After a short review, we will introduce the multiple integrals. 
2.1 INDEFINITE INTEGRALS 


An indefinite integral is defined as an antiderivative in the following sense. If a known function, F(x), is 


represented as the derivative of an unknown function, f(x), that is, F(x) = r then 


[ Fe dx = f(x). 


The function f (x) is the antiderivative, or integral, of F(x). In this case x is the variable of integration and 


the known function F(x) is the integrand. Since < = Oif C is a constant, it is customary to write 


| Fe dx = f(x)+ C, Eq. (2.1) 


where C is called the constant of integration. Some well-known indefinite integrals are 


[ ewe dx =exp(x)+ C, 
1 
[ear=inx+c, 
x 
J sinx dx =-—cosx+C, 
[ cos dx=sinx+C, 
| sectx dx =tanx+C, 


1 
| qe t= arctan x + G-; 
1+ x? 
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dx =arcsinx+ C. 


1 
Iz — x? 
The product rule of differential calculus [see Eq. (1.3)] corresponds to integration by parts in integral calculus. 


In differential calculus, if F(x) = f(x) g(x), then 


a uh - alfgGe] 


== F(x) g(x) = ——= 


Using the definition of an indefinite integral as an antiderivative, it can be written as 


d d 
[1c a+ [om ae = pogo. 


On rewriting this equation as 


df (x) 
[re ax= poe - [Tome — F4.220) 
we get the rule for integration by parts, which can be written in verbose form as 


| [FIRST][SECOND] dx 


= [FIRST][Integral of SECOND] — | Derivative of FIRST\[Integral of SECOND] dx. 


Whenever the integrand can be expressed as a product of two functions, FIRST and SECOND, it is convenient 
to identify as FIRST the function whose derivatives are comparatively simpler than the function itself. Similarly, 


it is convenient to identify as SECOND that function whose integrals are simpler than the function itself. 


An alternate way of writing the integration by parts procedure is 


[ F@ age = Feoaeo - [ 9@ afeo. Eq. (2.2b) 


Example: Evaluate J = f x? exp x dx using integration by parts. 


Solution: In this case, the integrand is a product of two functions, x? and exp x. Since x? becomes simpler on 
differentiation while exp x stays the same on differentiation or integration, we choose FIRST as x? and 


SECOND as exp x. Then, 


T= x? expx— | (2x) expx dx. 
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Now, I is converted into a simpler integral, which can be evaluated using integration by parts one more time, 
I= x*expx—2 [x expx— i exp x dx| = x*expx — 2x expx+ 2expx. 


Finally, after including the constant of integration, the integral is evaluated as 
I= expx(x?-—2x+ 2)4+ C. 
Example: Evaluate J = f x? In x dx using integration by parts. 


Solution: In this case, In x becomes simpler, compared to x2, on taking derivatives. So, we choose FIRST as Inx 


and SECOND as x2. Then, 


After including the constant of integration, the integral is evaluated as 
x3 
aay [3Inx -—1]+C. 


2.2 DEFINITE INTEGRALS 


In a definite integral the range of the values of x, the variable of integration, is provided. If the rangeisa <x < 


b, then 


b 
| Feo dx = FOOLER = FO) - FO, Eq.(23) 


where F(x) = f Thus, we first determine the function f (x) which is antiderivative is F(x). Then, the value of 


the definite integral is the difference between the values of f(x) at the upper limit and at the lower limit. 


There is an alternate way of interpreting a definite integral. If F(x) is a continuous function in the whole range of 
x,a <x <b,as shown in Figure 2.1, then we can divide the range into n equal intervals, each of width Ax = 
(b — a)/n. Also, if x;, for i = 1,2, ...n is the location of the midpoint of the ith interval, then the definite integral 


is defined as a limit of a sum as follows: 
b 


n 
[ Fe dx > lim > F(x,)Ax . 
n- oo 
i=1 


a 
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Figure 2.1. Area under the F(x) versus x curve is broken into strips. 


In Figure 2.1 the area under the F(x) versus x curve is broken into n strips, each of width Ax. Note that F(x; )Ax 


is the area of the cross-hatched ith strip shown in the figure. If the number of strips n > 0, then the width of 


each individual strip becomes vanishingly small and the sum of areas of all strips becomes equal to the area 


under the F(x) versus x curve. The definite integral it F(x) dx thus represents the area under the F(x) curve 


fromx =atox=b. 


Note that fi F(x) dx is anumber and its value does not depend on x. In fact, 


b b b 


[ Fe dx = [ Fo dy = | Fede, 


a a a 
Thus, in a definite integral, the symbol for variable of integration, x or y or z, is only a placeholder and it 


disappears after the integral is evaluated. So, in a definite integral the variable of integration is called the 


dummy variable. From the definition of a definite integral, it follows that 


b a 


[ Feo a = fF) - F@ =-F@ - FOI =- | FEO ax 


a b 
and 


a 


[ Foo ax =o. 


a 
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a c yp 


Figure 2.2. The point x = c lies inside the range,a < x < D, of the integrand. 


If the point x = c lies somewhere in the middle of the range of x, then 


b 
| F(x) dx = area under the curve from a to b 
a 
= area under the curve from a to c + area under the curve from c to b 
Cc b 


[ Fe ax+ | Fo) dx. 


a Cc 


Also, if F(x) = G(x) fora <x <b, then 


b b 


[ Fe dx = ie dx. 


a a 
Similarly, if F(x) < G(x) fora <x <b, then 


b b 


[ Fe dx < [ ee dx. 


a a 
If F(x) is positive for part of a range and negative for the remaining range of x, as shown in Figure 2.3, then 


b c b 


| Feo a= [ Foo aes | r@ar= +h, 


a a Cc 
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where I, > 0 and J, < 0. Note that even in this case the net area under the curve of a function F(x) between 


x =aandx = bis ie F(x) dx. However, the magnitude of the area under the curve of F(x) is fllF@Oldx. 


F(x) 


b 


Figure 2.3. The function F(x) is positive as well as negative for parts of the range of x. 


a 8g Xx 


Example: Find the net area and the magnitude of the area enclosed by function F(x) = x7 -5x+6= 


(x — 3)(x — 2) between x = 0 and x = 3 and the x-axis. 


Solution: In this case F(x) = 0 for0 <x < 2, and F(x) < 0 for 2 <x < 3. The net area is 


3 2 


0 0 x=0 


3 3 
x? 5x2 a 9 
[ Foo dx = | 2 -5x +6) ax = —-—+6x = SS 


To find the magnitude of the area under the curve, we break the integral from 0 to 2 and from 2 to 3. Then, 


2 


2 
xe 5x2 
= [@?-5x46)dx= zp te 


0 
and 


3 3 


xe by 
p= [@?-5x +6) dx = >-—+6x 


3 2 


2 2 


Clearly, , +1, = = = 7 = =, same as the net area, but the magnitude of the area is 


uleteye 4-4 e2? 
thle F|+ bala 
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2.3 DIFFERENTIATION OF INTEGRALS: LEIBNIZ’S RULE 


If one or both limits of a definite integral are functions of a variable, x, or if the integrand contains x as a 
parameter, then it is possible to differentiate the integral with respect to x. The example of such a definite 


integral is 


v(x) 
I(x) = | F(x,t) dt, 


u(x) 


where u(x) and v(x) are functions of x and t is the dummy variable. The derivative < is evaluated using a 
method developed by Leibniz. To understand Leibniz’s method of differentiating I(x), we first look at three 
simpler cases of this integral, namely, I, (x), I(x), and I(x). In integral I, (x), the variable x appears only in the 
upper limit; in integral I, (x), the variable x appears only in the lower limit; and in integral J, (x), the variable x 


appears only in the integrand. If 


x 


1G) = [ F@ at= fo) -f@, 


then 
dl, df 
———) ae 
dx dx () 
On generalizing, if 
v(x) 


na) =| F@adt= fed) -f@, 


a 


then 
dl, df(v(x))__ df(v(x)) dv(x) _ dv(x) 
dx dx dv dx FU)) dx ° 
Next, if 
b 
x) = | F@at= fO)- FO, 
then 


36 


Again, on generalizing, if 


I(x) = | F(t) dt = f(b) - fu), 


u(x) 


then 
dl, _ df (u(x)) _ df (u(x)) du(x) | 7 du(x) 
dx dx au dx (u@)) dx 
Finally, if 
b 
I3(x) = [ Fe t) dt, 
then 


b 
dl, OF (x,t) 
dx | Ox ae 


These three results of integrals I, (x), I(x), and I(x) can be combined and written in a single form, known as 


Leibniz’s rule, 


v(x) v(x) 

dl(x)_d | F(x,t) dt \ = F( ye F yee i OF Gt) 5 Fa. (24 

ne ae O30) = F(x, vx) = — Fix ue)) = I ax q. (2.4) 
u(x) u(x 


Example: Determine a if 


Solution: Using Leibniz’s rule, we have 


di(x) __ exp[x(2x)] _ exp[x(x)] 2x t exp(xt) 
dx 2x (2) x (1) + is t dt, 


or 


t=2x 


dI(x) _ exp[2x*] _ exp[x*] x exp(xt) 
dx x x x feo 


= (exp[2x?] — exp[x?]). 
2.4 MULTIPLE INTEGRALS 


Multiple integrals are generalizations of single integrals with one variable. For example, a double integral, I,, 


with two variables, x and y, can be perceived as two single integrals. 


x=by y=b2 x=by 
| | F(x, y)dy dx = | f(x)dx, Eq. (2.5) 
x=a, y=az xX=Q4 
where 
y=b2 
foo= | Fexdy. 
y=a2 


In evaluating the last integral for f(x), the variable x inside the integrand is treated as a constant. Similarly, a 


triple integral, Jz, with three variables, x, y, and z, can be perceived as three single integrals. 


x=b, y=b2 Z=b3 


n= || | | F(x, yaeavas= [das Eq. (2.6) 


X=a, y=a2 z=a3 


where 


b2 b3 
f(x) = | g(x,y)dy and g(x,y) = | F (x,y, z)dz. 


Again, in evaluating the integral g(x, y), the variables x and y inside the integrand are treated as constants and 
in evaluating the integral for f(x), the variable x inside its integrand is treated as a constant. Finally, if F(x, y, z) 
is a separable function, it means that F can be expressed as a product of three separate functions with each 


function containing only one variable as F(x, y,z) = h,(x)h2(y)h3(z). So, for a separable integrand F(x, y, z), 


by bo bs 
p= [ yGodx | ho dy | hy @ae 
ay az a3 


is also separable. 


Example: Evaluate the double integral So i,@ — y)*dy dx. 


38 


Solution: Consider the following integral, 


11 140 
| [@-vrayax = [Je —2xy + y") dydx 
0 0 0 0 


7 2 3yyat . 
=| eye dx = | (x?-x+5) dx 
0 y=0 0 
x3 x? xf? 1 
“le 2 3]. 8 


2.5 BAG OF TRICKS 


In this section we will introduce some tricks for evaluating somewhat complicated integrals by starting from a bit 
simpler integral which is commonly known. First, we look at some examples of indefinite integrals and follow it 


up by definite integrals (exponential and Gaussian integrals), which are useful in several areas of physics. 


Trick Number 1: Sometimes a complicated integral, which may appear quite intractable, can be manipulated so 
that it is written as the derivative of a simpler basic integral. In fact, it is possible to evaluate several new 
integrals by introducing a temporary parameter in the basic integral, and then differentiating with respect to this 


parameter to get the new integral. 


Trick Number 1a: As the first example, indefinite integrals of the type 


dx dx Be OF 
i> | oe 4274) 


are evaluated by starting from the basic integral se = In(x). We introduce a parameter a in the basic integral 


by changing x tox + aas 


[< = In(x + Eq. (2.7b 
cag 7 ne a). q.(2.7b) 


After having introduced the parameter a, we can now differentiate the integral with respect to a to evaluate the 


required integrals. 


dx 


Example: Evaluate the indefinite integral [ Gas” 
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Solution: Differentiate Eq. (2.7b) twice with respect to parameter a to get (x + a)? in the denominator. The first 


differentiation gives 


i dx _ 1 
(xt+ta)j? xt+a 
The second differentiation leads to 


5 dx 1 
“| Sar aaa 


which simplifies to the required integral 


dx | 1 
<=> ~  2(x+a)?’ 


Trick Number 1b: Next, we can evaluate integrals of the type 


1 1 
la = Re ax, [ (a2 — x2)5/2 aX, ... Eq. (2.8a) 


by starting with the basic integral f z = dx = arcsin x. On replacing x by x/a in the basic integral, we get the 
x 


Ji-xt 


integral with parameter a, 


1 _ fee 
= dx = arcsin ie Eq. (2.8b) 


By differentiating this integral with respect to parameter a, we can evaluate integrals appearing in Eq. (2.8a). 


Next, there are similar indefinite integrals of the type, 


x x 
le = 2) 52 ax, | (a2 — x2)5/2 dx, ... Eq.(2.9a) 


In this case, the basic integral is 


1-— x?, 


x 
a 
Gor 


which can be evaluated easily using substitution u2 = 1 — x7. On replacing x by x/a in the basic integral, where 


ais a parameter, we get 


DG 
a een 
[> dx a Gee Eq.(2.9b) 
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Differentiating this integral with respect to parameter a leads to evaluation of integrals of Eq. (2.9a). 


ree 2 
“ dx as well as the definite integral (i ——_y, dx. 


Example: Evaluate the indefinite integral [ @ (ay? 


- 2)" 


Solution: Differentiating Eq. (2.9b) with respect to a gives 


| 1 2a ie 1 2a 
aera aaa #4 =-7— 


or 


x 1 
lear’ TSS 


The definite integral is obtained by substituting the upper and the lower limits, 


a/2 


x 
dx = 
| aaa V3a a 


Trick Number 1c: Several indefinite integrals of the type 


1 1 
—— dx, | ———— ax.,... Eq. (2.10 
le + a2)? 25 la + a2)3 x q-( a) 


can be evaluated by starting from the basic integral qi ere dx = arctan x. Introduce a parameter a by replacing 


1: 
x24 
x by x/a in the basic integral to get 
i 1 x 
| Foe dx = Foca (=) i Eq. (2.10b) 


Now, multiple differentiations of this integral with respect to parameter a lead to the required integrals of Eq. 


(2.10a). 


Example: Using the integral i —zdx = = as a guide, introduce a parameter and then differentiate with 


respect to this parameter to evaluate the integral 


ane | 
| oa 


dx 
1+x2 


Solution: In the guiding integral i = zi introduce a parameter y by changing x to x/y, 
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[ 1 dx [ dx 1 
ea Py rer ee 
0 4 + (=) y 2 o x“+y 2y 


Now, to get (x? + y)? in the dominator, we differentiate the integral with respect to parameter y twice. The 


first differentiation gives 


[ bosle-absh 


or 


m1 
j (x? + y?)2 4y® 


The second differentiation gives 


[ basale-il sl: 


or 


3m 1 
j (x? + y?)? 16 y*" 


which is the value of the required integral. 


Next, somewhat similar integrals of the type 


x x 
i CIE ax, | (pp Eq. (2.114) 


can be evaluated by starting from the basic integral 


x 1 . 
[a dx = 5 in + 1). 


This integral is easily evaluated by using the substitution u = x? + 1. On replacing x by x/a, where ais a 


parameter, we get 


| nea Eq.(2.11b 
omarae x= 5In(x a na. q.(2.11b) 


Differentiating this integral with respect to parameter a provides the integrals of Eq. (2.11a). 


Trick Number 2: Continuing with our bag of tricks to solve indefinite integrals, let us evaluate integrals of the 


form fi ax, where P(x) and Q(x) are polynomials of x. 


Trick Number 2a: Our first case is the one in which degree of polynomial P(x) is less than the degree of Q(x). 


Examples of such integrals are 


dx x dx 
(ee eee 
x2 -—x-6 x2—7x+12 


If the denominator can be factored, then use partial fractions to break the integrand of the first integral as 


1 = 1 7 A a B 
x?—-x-6 (x—-3)(x+2) x-3 x42 


To determine A and B, multiply both sides by (x? — x — 6) to get 


1= A(x +2) + B(x — 3). 


Since this is true for any values of x, set x = 3 and x = —2 successively, to get A = = and 
B=--. Thus, 
5 
1 ees ae 1 
aoycerls aI 


The original integral then becomes 


dx 1 dx 1 dx 1 al 1 x-3 
[= -:/4-:/ = gIn(x = 3) — gIn(x + 2) = gIn( ) 


x2—x-6 5) x-3 SJ xt+2 x+2 
Similarly, for the second integral, 


x 7 x * A - B 
x2—7x+12 (x-4)(x-3) x-4 x-3’ 


or 
x = A(x -—3)+ B(x -4). 


On setting x = 3 and x = 4 separately, we get A = 4 and B = —3. So, 


—— = — = 4] 4) —31 3 
Paiehi2 ee page 


Trick Number 2b: There are situations in which the denominator of the integrand cannot be easily factored. In 
this case, one can try to complete the square and make a substitution that will turn the integral into a simpler 
well-known integral. For example, 
| dx _ i dx 
3x2-6x+7 J 3(x—1)?+4 
Define a new variable, u = V3(x — 1)/2, or dx = 2 du/V3. With this substitution, 


i dx 1 pe (x - | 


a ae 33 — a: i~ 7 aes = qarcan 


Trick Number 2c: \f the numerator contains a polynomial P(x) of degree one less than the degree of polynomial 


Q(x) in the denominator, then write the numerator as 
numerator = a(derivative of denominator) + b, 


where a and b are some numbers. In this manner the integral becomes tractable. As an example, consider the 


integral 


| x+4 d 
x2+2x4+5 


The numerator can be expressed as 


g 2742 5 3 
a + 2x + ]+ 


Ne 


x+4= 


so that 


[= d -(s so tx? + 20 +5] +3) => Ine? +2 ++ [so 
x +axt+5 Sareea er eee }=3 me 3 x24 2x+5 


The remaining integral can be determined using “complete the square” technique as described in Trick Number 


2b. 


Trick Number 2d: Next, we consider the case in which the degree of polynomial P(x) in the numerator is more 
than the degree of polynomial Q(x) in the denominator. In this situation, it is best to use long division to simplify 


the integrand. 
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Example: Evaluate the indefinite integral, 


| ee 


x2-—2x+5 oA 
Solution: Using long division, 
Sx — 7a +17x-3 _, Pan 
x? -—2x4+5 id x2—2x4+5 0 
Thus, 
p= [es Rena p2 (Ga +| 2 dx 
= x2-—2x4+5 Ones . x2 -—2x+5 
3x? 
=a e+ 2h, 


where I, can be evaluated using “complete the square” technique as described in Trick Number 2b. 
Trick Number 3: Integrals involving trigonometric functions are simplified using standard identities such as, 
cos* x + sin? x = 1, 
sin(2x) = 2sinx cos x, 
cos(2x) = 2cos?x —1=1-—2sin? x, etc. 
Example: Evaluate = { sin? x cos* x dx. 


Solution: The given integral is 
[= | sin? x cos* x [sinx dx] = -fa — cos? x) cos? x d(cosx). 


On substituting u = cos x, we get 


3 5 


_ [a 2 ip ies cate _ cos* x | cos? x 
Sipe eg ne ae ere agro 


Example: Evaluate J = { sin? x cos? x dx. 


Solution: Since sin? x cos? x = = sin? (2x) = =[1 — cos(4x)], we get 
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I= ;{ 0 — cos(4x)] dx = al" - =“ : 


2.6 EXPONENTIAL AND GAUSSIAN INTEGRALS 


Continuing with our bag of tricks, now we look at some examples of definite integrals. Two types of integrals that 
we commonly encounter in physics are exponential integrals and Gaussian integrals. As the names suggest, the 
exponential integrals contain the exponential function, exp(— x), as a part of the integrand. The Gaussian 
integrals contain the Gaussian function, exp(—x?), as a part of the integrand. The exponential integrals, I,, are 


defined as 


eo = | x" exp(—ax) dx, Eq. (2.12) 
0 


and the Gaussian integrals, I,, are, 


i = | x” exp(—ax?) dx. Eq. (2.13) 
0 


Sometimes we also encounter Gaussian integrals over an extended range of variable x, 


gg 
= le exp(—ax?) dx. Eq. (2.14) 


In these integrals n, a positive integer, is called the order of the integral and a is a positive constant. In each case 
we will try to set up a reduction formula that relates the integral J” to similar integrals of lower order, such as 
I™1 and/or I”~2. Using this reduction formula successively, one can relate J” to simpler integrals J+ and/or I°. In 


case of the exponential integral, we have, for n = 0, 


co 


1 
Ig = | exp(—ax) dx = a 


0 


Also, forn = 1, we can integrate by parts to get the reduction formula 


(= 


—-a 


n CXP(—ax) i 


i | x" exp(—ax) dx = x a 
0 


-1 tt -1 
nx" dx =0+-I?™. 
a 
o 0 
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The first integrated term vanishes at both limits. At the upper limit, as x — 00, the factor exp(—ax) approaches 
zero faster than any increase in x”. At the lower limit, x” goes to zero since n = 1. This reduction formula can be 


used successively to obtain 


« (OMT) . CO @— 1) G2) 
[2 = —— yn? = 3 =, = A SD, 
a a a a a a aa 
or, on substituting the value of J° here, 
n! 


In case of Gaussian integrals, [7, we need L and i to use the reduction formula. We postpone the evaluation of 


i until after we have discussed curvilinear coordinates and multiple integrals in Chapter 9. The value of this 
integral, however, is, 
co 
1 ( 2) d 1 [fr 
= | exp(—ax x=-=/-. 
g P 2Va 
0 


The integral J2, on the other hand, can be evaluated easily by substituting u = ax?, 


p= [xen (-ax?) dx = : foes (-u) du = : 
97 P ~ 2a P ~ 2a 
0 


0 


To set up the reduction formula for Gaussian integrals, we use the fact 


ae ae = —2ax exp(—ax’) or x exp(—ax?) = ae eee 
dx 2adx , 


Thus, for n = 2, using integration by parts we obtain 


n Wal 2 1 nn d 2 
Ij = | x" *+xexp(—ax*) dx = oe iG "Fy CxP(-ax ) dx 
0 0 
1 fe 
= ——x""1- exp(—ax?)|P + mal (n — 1)x"~? exp(—ax?) dx Eq. (2.16) 
2a 2a 
0 
i = Al i n—-1 
=O+ “= x"? exp(—ax?) dx = ( a J pn-a, 
0 
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Using this reduction formula, we can write 17, Ij, 1g «+ in terms of Ij, and I3, i, Ij «in terms of Ij. Forn = 2m, 


where 7 is an integer, 


Sis. (2m)! P 
9 (4aym(m)! 7 


Similarly, forn = 2m + 1, 


(mo! 


2m+1 — 
Ig = 


Finally, the Gaussian integral, [j,, over an extended range of variables is 


foe) 


* 0 
C= [2 exp(—ax?) dx = | x” exp(—ax?) dx + les exp(—ax?) dx. 


0 
In the second integral, change the variable from x to y via x = —y. Then, 
0 


i= | x” exp(—ax?) dx — Jor exp(—ay*) dy = IP + (—1)" ff. 
0 00 


Thus, if n is an odd integer, then Iy9 = 0, and if nis an even integer, then Ib9 = 2 ie 
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PROBLEMS FOR CHAPTER 2 


1. Evaluate using partial fractions 


2. Evaluate the indefinite integral 


_ 
x3 —x%-—x4+1 


using the bag of tricks. 


3. Use Leibniz’s rule to find dI/dx for 


x2 
l= | (x — t) dt 
a-<z. 
witha > 0. 


Next, evaluate the integral I explicitly, and then find dI/dx. 


4. Use Leibniz’s rule to find the value of @ that provides the extremum value of the integral 


1(9) = a ( 7 . = ee 


5. Define an integral /,, = f(Inx)” dx. Using integration by parts, derive the following reduction formula: 
I, =x nx)” — nIy_3.- 


Use this reduction formula to determine 


I; = | (nx)? dx. 
,= [ana 


6. Consider the integral 


sin” x dx. 


= 
ll 
ona 
N 


Explicitly evaluate the first two integrals J, and /,. 


Using integration by parts and the identity cos? x = 1 — sin? x, derive the reduction formula for n >2: 
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Using the reduction formula, evaluate the integrals 


1/2 1/2 
I, = | sin'xdx and I, = | sin® x dx. 
0 0 


7. Consider the integral 
a 
I, = fe — a*)" dx. 
-—a 


Explicitly evaluate the first integral J). 
Using integration by parts, derive the reduction formula 


2na? 


= - 
n 2n+1 "1 


for n >1. Use the reduction formula to determine the value of 
a 
I= [e — a’)? dx. 
=f 


8. Introduce a parameter in the standard integral 


dx = arcsinx 


Free: 


and then differentiate with respect to the parameter to determine the value of 


a/2 dx 
= i (a2 — x?)3/2 . 


9. Biomedical Physics Application. A gunshot wound victim is bleeding at the rate given by 
r(t) = rot? exp(bt), 


where rp = 0.000008 ml/s?, and b = 0.004 s~1. The ambulance takes five minutes after the shooting to 
arrive at the crime scene. How much total blood has left victim’s body before the medical attention is provided 


to the victim? 
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A healthy adult has about 5 liters of blood circulating in the human body. Most adults can lose up to 14% of 


their blood before their vital signs begin to deteriorate. What would be the outcome of the victim? 


10. Biomedical Physics Application. The West Nile Virus is a disease that is contracted from infected 
mosquitoes. Scientists at the Centers for Disease Control and Prevention monitor the population growth of 
mosquitoes in a controlled environment. The mosquito population in this environment starts with 1000 


mosquitoes at the start of the summer and grows at the rate of 
r(t) = (1000) exp(0.1 t) 


mosquitoes per day. What is the total mosquito population at the end of the seventh week of summer? 


11. Biomedical Physics Application. In physiology we learn that a typical healthy adult human being breathes 12 


to 15 times per minute. The volume of air inhaled in a single breath is about 500 mL. Assuming that breathing is 


a periodic process taking 5 seconds from the beginning of inhalation to the end of exhalation, the rate of air 


flow into the lungs can be represented by the function 


is aaa (=) mL 
f= Tt sin ae eer 


Determine the total volume of the air inhaled at any time t starting at t = 0. Provide the volume of air inhaled 


by a healthy human being att = 1s, 2s,3s,4s,and5s. 
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Chapter 3: Infinite Series 


In this chapter the convergent or divergent behavior of an infinite series is investigated. Several tests for 
checking this behavior are outlined. Taylor series and Maclaurin series are described and are used to obtain an 


infinite series expansion of several simple functions. 
3.1 AN INFINITE SERIES 


Consider an infinite series 


where each element, a,,, of the series is a separate constant. For this series we define the partial sum, S(NV), as 
the sum of its first N terms 
N 
S(N) = a, +a, +++ dy =) de 


n=1 


The infinite series is said to converge if the partial sum S(NV) has a finite limit as N — 00. This limit is the value of 


the converging series; that is, if 


lim [S(N)] = S, 


N-oo 


then we write 


$= ay tay to aq t=) dn. Eq. (3.1) 


If the partial sum does not have a finite limit as N — 00, then the series is divergent. Note in passing that in the 


summation notation, the running variable n can be replaced by any other symbol as follows: 


co co co 
s=) a, or $=) dp or $=) ay, 
n=1 m=1 p=1 


Therefore, the running variable n or m or p in the summation notation is referred to as the dummy variable in 
the same sense as the dummy variable of a definite integral; that is, it serves as a placeholder that disappears 


after the sum is evaluated. To determine the convergence of a series, we compare it with another series whose 
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convergence or divergence is known. For the comparison purpose we use either a geometric series or a harmonic 


series. 
Geometric Series 
The series in which the ratio of two successive terms is constant, namely 


atax+ax? +ax?+--ax™14-. Eq. (3.2a) 


is known as a geometric series. Here a is the first term and x is the common ratio. The partial sum of first N 


terms of this series is 


S(N) =atax+ax? +--ax-V, Eq. (3.2b) 
On multiplying this partial sum by the common ratio x, we get 
xS(N) = ax +ax? +--axQ-) + ax, Eq. (3.2c) 
Subtracting Eq. (3.2c) from Eq. (3.2b) results in 
(1 —x)S(N) = a- ax’, 
or 


— spill 


S(N) = a= — Eq. (3.2) 


Now if x > 1, then S(N) > 0c as N — 00, so that geometric series is divergent for x > 1. For x < 1,x% — 0 for 


N > o5s0 that 


lim [S(N)] > — Eq. (3.2e) 


N->0©o 


Therefore, the geometric series, a + ax + ax? +--+, converges to — for x < 1 and diverges for x > 1. Note 


that in a converging geometric series, the common ratio x can take any value as long as this value is less than 1. 
Harmonic Series 


Next, we consider a harmonic series. Unlike a geometric series, which can take several values depending on the 
value of x, there is only one harmonic series. Successive terms of the harmonic series are reciprocals of the 


natural integers as follows: 
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jee te hie 
2B A n 


Even though each successive term in this series is smaller than the previous term, the series is divergent. The 


divergent nature of the series can be seen by grouping the terms as 


1+(s)+(G+)+(stetzta) + (5+ +2)+ 
2° \3" 4) °\5 "6°77 8)" \9 16 


Thus, for harmonic series, 


1+(s)+(F4+2)+(Etetzta) + (5+ +2)+ SH te 
2) °\3" 4) °\5°6'7° 8)" \9 16 279° 3 


Now, the series on the right-hand side of this inequality is clearly divergent since the partial sum of this series is 


1 N+1 
SW) = 145-1) =—— j 


and 


lim [S(N)] > ©. 


N->© 


Thus, the harmonic series is known to be a divergent series. 
3.2 TESTS FOR CONVERGENCE 


We can use several different tests to check the convergence of an unknown series. Three common tests are the 


comparison test, the ratio test, and the integral test. 
Comparison Test 


Now, to test the convergent or divergent nature of a series, we compare it term-by-term with another series 
whose convergence or divergence is known. For example, if the infinite series a, + az + a3 +-:- is known to be 
convergent, then consider an unknown series, b,; + bz + b3.... If by < ay for all values of n from 1 to infinity, 


then 1, Dy S Yin Gn and, therefore, >", b, is also convergent. 
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On the other hand, suppose the infinite series uw, + uz + U3 ... is known to be divergent. Then, consider an 
unknown series v, + v2 + v3... If vy, = u, for all values of n from 1 to infinity, then ), UV, = Yin Uy and, 
therefore, >), V,is also divergent. For this comparison test, a standard geometric series will serve as a prototype 


convergent series, and the harmonic series will serve as a prototype divergent series. 


Example: Check the convergence of the following series. 


8 


n=1 


Solution: Since — < = for any n, and vrei is a convergent geometric series with the common ratio 1/3, the 


unknown series S is also convergent. 


Example: Check whether the following series converges or diverges. 


a yee 


Solution: We compare this series with the harmonic series. Since —— 


ae >= ~forn > 3 and Yr 4 * is the divergent 


harmonic series, so S is also a divergent series. 


Ratio test of Cauchy and D’Alembert 


The Cauchy and D’Alembert ratio test provides the ability to compare an unknown infinite series with the 
geometric series 1 + x + x? + -+-. We know that the above geometric series converges only if x, the ratio of two 
successive terms, is less than 1. Thus, for the unknown series a, + @z + 3 ...+ Gy + Any, + °°, withall a; > 0, 


we look for the ratio of two successive terms, sf 
an 


in On+1 


n->c an 


< 1, the series is convergent, 


n-co An 


> 1, the series is divergent, and Eq. (3.3) 


lin = 1, the nature of the series is indeterminate. 


n-c an 


Example: Determine the converging or diverging behavior of the infinite series, 
1 ii 2 * 3 Yan 
23a ara (n+ 1)! 
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Solution: In this case, 


— Anay N+1 (n+1)! n+1 
lim — = ——— —_— = ——___ > 
n>0 A, (n+2)! n (n+2)n 


Thus, according to the ratio test, the series converges. The value of this series is 1 [see Example 3 at the end of 


this chapter]. 


Example: Check the convergence of the following series, 


n=1 
Solution: In this case, 
n+1 
an, (3) @+D! 4 
] — = ==(n+1) >, 
NO An (3) al 3 
3 


Thus, the unknown series is divergent. 
Integral test of Cauchy and Maclaurin 


Once again, consider an infinite series of constants, 


Imagine a function, f (x), that is a continuous and monotonically decreasing function of x with f(n) = ay. 


Consider its partial sum 


S(N) Sy : 


n=1 


Now consider a series of unit-width rectangles, with heights a,, a2, a3, ...etc, as shown in Figure 3.1. From the 


figure 
N 
a, +a, +++ ay_, > ) f(x)dx [dashed line] , 
1 


and 
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N 
Az +Az+-+ay < i f(@x)dx — [solid line] 
ali 


J) 


Figure 3.1. Integral test of Cauchy and Maclaurin. 


That is, 


N+1 


S(N) > f(x)dx 
and 


N 
S(N) <| f(x)dx+a,. 


On combining both the results, we get 


N+1 


f(x)dx < S(N) < [ rene +a,. 


Now take the limit N — oo to get, 


[ reoas < y an < [ Foods + ay. Eq. (3.4) 


Thus, the series )\>_, a,, converges or diverges just as the integral SP fax converges or diverges. 
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Example: Check the converging behavior of the Riemann Zeta function, 


¢(p) = _ 
n=1 


Solution: We define f(x) = = 50 that f(n) = <=. Then, 


x Pti 


f (x)dx =A xPdx=\-p+1 
z is In(x)|2,  forp=1 


I~, forp #1 


The integral i f (x) dx, therefore, diverges for p < 1 and converges for p > 1. The series for the Riemann Zeta 


function is thus convergent for p > 1. This is sometimes referred to as the p-series test. 
Absolute Convergence 


The series ))7_ @,, is said to converge absolutely if the related series }'7_, |a,,| also converges. An absolute 


convergence implies convergence but not vice versa. As an example, the series 


converges to In 2 [we will show it later in this chapter; see Eq. (3.12)]. But the corresponding absolute series, 


ieee 
2 3 4 5 ‘ 
being the harmonic series, diverges. So, the original series is not absolutely convergent. An absolutely convergent 
series is important for two reasons: The product of two absolutely convergent series is another absolutely 


convergent series. The sum of the product of two absolutely convergent series is equal to the product of the sum 


of two individual series. That is, 
SaSp + SaSe¢ = Sqa(Sp + Se): 


If an infinite series is absolutely convergent, its sum is independent of the order in which the terms are added. 
Stated differently, if an infinite series is not absolutely convergent, the value of its sum will depend on the order 


in which the terms are added. 


As an example, let us go back to the series 
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which is not absolutely convergent. Now we show explicitly that the value of this series depends on the order in 


which its terms are added. Separating out the positive and negative parts of this series, we get 


s=(1t5tet7 ) (G+ztet ) 
7 S52 do Ae eG 


The terms in the second parenthesis can be further separated out as 


sa(1+5+5+7 ) Gtetaty ) Gtetoty ) 
= head are —— see 2 6 10 14°77 4 8 12 16° . 


Now, take out the common factor of = in the second parenthesis and then easily combine the first and second 


parenthesis as 


s=(St+5taty ) Gtetoty ) 
~\o 6 10 7A. 4 8 12 16°) 


On taking out an overall common factor of <, we get 


BIR 
+ 
wlR 
on 
—— 


or 
sats 
2’ 


which, of course, is an absurd result. The absurdity arises since this series is not absolutely convergent. 


The second property of absolutely convergent series allows us to rearrange various series. Suppose S, = in-1 An 


and S, = Din-1 b, are two absolutely converging series, then S = S,,S,, is the following double series: 


S > > dn bn = > Cri 4 


m=1n=1 mn=1 


In this double sum the summation is carried over rows (and columns) of the table shown in Figure 3.2. However, 


if we define 


r=n+m-1, 
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= C,, Cis Ci, C., 
a ra 
2) Cy C.> Cog Co, 


Figure 3.2. Evaluation of a double infinite sum. 
then we note that r also takes all integer values over the range from 1 to ©. So, we can replace the dummy 


variable m by variable r. Also, notem=r—-—n+12>1 or n<r.Thus, 


aS ». Ds Cnn = » Cr—-ntin * Eq. (3.5) 


This summation is carried along the diagonal of the table in Figure 3.2. The advantage of this manipulation is that 
a double infinite sum is reduced to a single infinite sum plus a finite sum. This works only for absolutely 


convergent series. 
3.3 SERIES OF FUNCTIONS 


If each term of an infinite series is a function of a variable x, then the sum of the series, if it exists, is a function of 


x. Such a series looks like 
Ag(x) + ay (x) + an (x) + +a, (X) +0", 


with the sum of this infinite series being 


S(x) = yy a,(x) = Jim S(N;x). 
n=0 
Here S(N; x) is the partial sum, S(N; x) = )4_, a, (x). In case of series of functions, we will start the sum from 


n = 0 instead of n = 1, as done previously for series of constant terms. 


The dependence of Jim S(N; x) on x is expressed through uniform convergence of the series. If for a certain 


range of values of x, a < x < b, and for any small number € > 0, it is possible to choose a number v [which is 


independent of x] such that 
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S(x) —€ < S(N;x) < S(x) +€ for allN =v, 
or, alternatively, 
|S(x) — S(N;x)| <é€ for all N > v, 


then the series converges uniformly to S(x) in the range a < x < b. Thus, no matter how small € is, it is always 
possible to choose a sufficiently large N such that the difference between the full sum S(x) and the partial sum 
S(N; x) is less than € for all values of x between a and b. A good thing about a uniformly converging series is 

that it can be differentiated and integrated term by term. A series in x which converges uniformly in the interval 
a <x < bcan be integrated term by term provided the limits of the integration lie within [a, b]. Thus, if Six) = 


yn=0 An(x) is uniformly convergent, then 


q 4d 
[seoar=)'] A, (x)dx , Eq. (3.6a) 


provided p and q lie within [a, b]. Similarly, a uniformly convergent series can be differentiated term by term 


within its range of convergence; that is, if S(x) = iF) a,(x), then 


dS(x) _ \> da,(x) 
ibe ae dx — 


n=0 


Eq. (3.6b) 


Taylor and Maclaurin Series 


The reverse process, in which a given function S(x) is expanded into an infinite series, is called the power series 
expansion. Two power series expansions of interest to us are the Taylor and the Maclaurin series expansions. To 


gain more insight into the Taylor series expansion of a given function S(x), consider expanding this function as 


S(x) = Cy +0,(X —a) +0,(x — a)? ++ +0, (x —a)" +-", Eq.(3.7) 


where the coefficients c,, are to be determined. This is the power series expansion of S(x) about the point x = 


a. Define 


dS (x) 


dx” 


SQ) = 


and $™(a) = S"(x)|y=a- 


On setting x = ain Eq. (3.7), we note that cy = S(a). On taking the first derivative of the series in Eq. (3.7) and 


then setting x = a, we get 


dS(x) 
dx 


=O0+c¢, +2c,(x—-a) t+: 
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and 
dS 
qQ = qx lx=a = SY (a), 


Similarly, on taking the second derivative of the series in Eq. (3.7) and then setting x = a, we get 


d’s 
Txt 21 Men +3 2e3(x — a) ++ 
and 
_1d’s _ SP) 
2 dx? x4 ~ 2 


Similarly, on taking the third derivative of the series in Eq. (3.7) and then setting x = a, we get 


_ ids Sa) 
“3 = 31 dxs =a ~ 3 
In general, 
_ id's _ 5S (a) 
fn =~ xn x=a 


Substituting the values of c,, in Eq. (3.7) provides the Taylor series expansion, namely 


co 


i) 
SG) = y — (x-a)”. Eq. (3.8a) 


n=0 


This is the Taylor series expansion of S(x) about x = a. In general, the Taylor series can be truncated after N 


terms as 


N-1 


(n) 
S(x) = 2 saa (x-—a)"+ Ry 


n! 


n=0 


s)(p) 


7 (x — a)", for any b between a and x. The Taylor series expansion is 


where the remainder Ry is 


convergent only if Ry ie 0. 
00 


Fora=0, 


62 


co 


(n) 
S(x) = yy 2) re Eq. (3.8b) 


n! 


n=0 


which is purely a power series expansion of S(x) and is called the Maclaurin series for S(x). 
Example: Determine the Maclaurin series expansion of S(x) = exp x. 
Solution: In this case, we note that S™ (x) = exp x and 50) = 1 for all n. The Maclaurin series becomes 


n 2 x3 


x x 
expr=) altrtptot~ Eq. (3.9) 


n=0 
The convergence of the series can be checked by the ratio test since ant = — —> 0asn- o and x is finite. It is 
n 


worth commenting that we can make expansion of exp x about any point (other than zero) using Taylor series. 
Example: Determine Maclaurin series expansions of sin x and cos x. 


Solution: In this case, 


f(x) =sinx, f(0)=0 ; g(x) =cosx, g(0) = 1, 
f(x) =cosx, f%(0) =1 ; g(x) = -sinx, g™(0) =0, 
fP'@) =—sing, 7 O) = 0; g(x) =—tosx, gO) =—1, 
fe) = =tosx, fP'O)==1 3 g(x) =sinx, g®(0) =0, 
FOR) Ssinay, (YP O)=0; g(x) = cosx,g(0) = 1. 


Thus, in general, for n = 0,1,2---, 
pO) a0, fo) (3 g?™(0) = (-1)", g?"*9(0) = 0. 


Use these values in 


“FM 9  g™(0 
fo= yt Oa and g=)* ©) vn 


n n! 


to obtain 
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co 
x3 x? x2nti 


see ae ee) CU ee Eq. (3.10) 
n=0 
a xe ee x2n 
cosx = ae (ny! Eq. (3.11) 
n= 


Example: Determine Maclaurin series expansion of S(x) = In(1+ x). 


Solution: In this case, S(0) = 0 and 


SY (x) = (1+ x)7? s%(0) =1 
SA (x) = -(1 +x)? $0) =-1 
SO (x) = 2(1 + x)73 5 (0) = 2! 
SM (x) = -3!(1 + x)74 50) = -3! 


SM (xX) = (-D™1M-1!IA4+x)™ nB1 SMO) =(-D)™1*M-1)! nZ1. 


Thus, the series for In(1 + x) is 


~ (je x x 
Sq) = In +x) = ) = ax -S45- T+ Eq. (3.12) 
n=1 


Note in passing that S(1) = In(2) = 1— + - = - + +++ as mentioned earlier in this chapter. 
Example: Determine Maclaurin series expansion of a binomial function (Binomial Theorem), namely 
f(x) =(1+x)" 
where m is a real number (positive or negative; integer or non-integer), and |x| < 1. 
Solution: In this case, various derivatives of f(x) are: 
Pf" @ amex 
f"O) =mn= Dasa)" 


f2(~) = mm - 1)(m —- 2)(1 + x)? : 
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f™(x) = m(m—-1)(m—-2)...(m—n+1)(1+x)™™. 
On setting x = 0, we get 
f™(0) =m(m—-1)...(m—n + 1). 


Thus, the Maclaurin series for the binomial function is 


3! 


f(x) = (1+ x)™ = 1 tmx + MED y? 4 MO St Eq. (3.13) 


As a particular case, if m is a positive integer, say N, then Nth derivative of f(x) is 
f™() = N(N -1)(N -2)...01) = M, 


which is independent of x. So, all higher derivatives starting from f"+ (x) onwards are zero. In this case, the 
infinite series expansion of the binomial function (1 + x) reduces to a finite series. For N = 2,3,4... we have 


the well-known expansions 

(1+x)? =1+2x+x?, 

(1+x)? =14+ 3x4 3x? 4+ x3, 

(14+ x)* =14+ 4x 4+ 6x? + 4x3 4 x4, 
etc. 


Additional Examples 


If a series of functions is a uniformly convergent series, then it can be differentiated and/or integrated to reduce 


an unfamiliar series to a familiar one. A few examples of this trick are discussed here. 


Example 1: Consider the series S(x), 


S(x) = 14+ 2x4+ 3x7 4+ 4x3 +++ 


where the range of convergence of this series is known to be —1 < x < +1. Integrate the series term-by-term 


obtain a geometric series, 


x 


x 
[ seodx = 242? 48 tat t= ; 
0 1-x 


Now differentiate both sides of this expression to get 


to 
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e _d ia arog x - 1 
=o =e) "a5 Gan? Gee 


Example 2: Consider another series S(x), 


Ve oes pe ee 
= +33 *34'Gs 
then, 
x2 x8 xt x8 
25 a OO he 
WSO) =T5 +55 +34 ta5t 


On taking the second derivative of this series, it reduces to a geometric series, which can be easily summed as 


d? 1 
Ses] <1 tx 4x8 tx8 bo 


1-x 


Now, we integrate this expression twice to get S(x). The first integration gives 
© (25(x)) =—-In(1-—x)+C 
dx o 
and the second integration gives 
x?S(x) = - f inca —x)dx+C,x+C,. 


Here C, and C, are two constants of integration. The indefinite integral here can be evaluated using the 


substitution, y = In(1 — x), or equivalently, x = 1 — exp y. Then, 


[ in -29 ax = | y{-exp yldy = -yexpy + | expydy = (1- y) expy = (1—x)[1—-—In(Q1—-x)] . 


So, 


x*S(x) =-(1-—x)[1 -n(1 —x)] + ¢C, «4+. 


To evaluate the two constants of integration, we set x = 0 and recall that 


In(1 — x) = —-x 


2 
27 kates 


to obtain 
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Cy, =1. 


Substitute this value of C, in the above expression for S(x) and then divide both sides by x to get 


xS(x) =1 


Pcie 


Again, set x = 0 to obtain C, = 0. So, finally, the value of the original series S(x) is 
ey ee etre 
x)= ¥ oi n( x). 


In some cases, even if the series does not contain a variable, one can still use the trick of differentiation and/or 


integration by introducing a variable judiciously. 


Example 3: Consider the series 


d 3 x4 nl 
=x+¢x*+—4+—4+--+—— +: 
de Oa (n— 2)! 
2 3 xn-2 
axfrext Dod tf sree 


On integrating both sides, we get 


x 


[fe = f(x) — f(0) = I xexpx dx. 


x 


f(x) = xexpx|5 - | expx dx = xexpx —expx+1=1+(x-1)expx. 
0 


Then, the value of the original series of constants is S = f(1) = 1. 
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PROBLEMS FOR CHAPTER 3 


1. Find the value of a (a > 0) if 


or Atay 

— (1+ — an hae 

2. Use the comparison test to check the convergence of following two series. 
>. 32 and » Va 
n=1 n=1 


3. Use the Cauchy and D’Alembert ratio test to check the convergence of following two series [Useful 


information, e = 2.72]. 


- n exp(—n) and fp? 
n=1 n=1 : 


4. Use the Cauchy and Maclaurin integral test to check the convergence of following two series. 


tee 14+ n? 
n=1 n=1 


5. Show that the first five terms in the Taylor series expansion of exp(—x) cos x about x = 0 are 


) = 1 + oe st is 
exp(—x) cos x = x+y>-Etag-: 


6. Expand the integrand below using binomial theorem 


1- 


x 
a" (4) =(— = 
(=) 
0 


and integrate term by term to obtain the power series expansion of 


1+x 
n( ) for |x| <1. 
1-x 
7. By comparing with other well-known series, evaluate the sum of the following series exactly 


(In3)? (In 3) 
2! 3! 


1+1n3 + 


8. The classical expression for the kinetic energy of a particle is KE = m, v* where m, is the mass of the 


particle when it is at rest and v is the speed of the particle. When the particle is moving with speed comparable 
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to c, where c is speed of light, the motion of the particle is correctly described by the theory of relativity. 


According to this theory, the mass of a particle moving with speed v is 


Mo 
d 
[1 vu 
C2 


and the kinetic energy of the particle is the difference between its total energy mc? and its energy at rest m,c?, 


m= 


that is, KE = mc? — m,c?. 


(a) Using binomial expansion show that when v is very small compared to c, this expression for the kinetic 


energy of particle agrees with the classical expression. 


(b) The leading relativistic correction term for the classical expression for the kinetic energy is of the 


4 
form am, aa What is the value of a? 


9. Biomedical Physics Application. In the thirteenth century, Italian mathematician Fibonacci investigated the 


growth of rabbit population by assuming, 
(a) rabbits never die, 
(b) rabbits can mate at the age of one month with the gestation period of one month, 
(c) the mating pair produces a new pair every month starting at the age of two months. 


Starting with a newly born pair of rabbits, what is the number of rabbit pairs at the beginning of nth month? 


Here is the accounting: 

At the beginning of first month, there is only one original pair, a1 = 1. 

At the beginning of second month, the pair mates, but there is still only one pair, a2 = 1. 

At the beginning of third month, there is original pair and a newly born pair, a3 = 2. 

At the beginning of fourth month, original pair produces a new pair and the one-month-old pair mates, a4 = 3. 


At the beginning of fifth month, original pair and two-month-old pair reproduce while one-month-old pair 


mates, as = 5, etc. 


At the beginning of nth month, number of pairs, a,,, equals the sum of newly born pairs [which is equal to the 
number of pairs at the beginning of (n-2)th month, a,,_>] plus the number of pairs living at the beginning of (n- 


1)th month, a,_,. Explicitly, the first few terms in the Fibonacci series look like 


14+14+24+34+5+4+84+13+21+.., 
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with the recursion relation a, = Qyj_, + Qy_2. 


(i) If lim “4 = 2, then show that a = “8%, 
no an 


(ii) Prove the following five relationships for elements of the Fibonacci series, 


The Fibonacci series appears in several biological settings, such as number of leaves on a stem, number of 
branches in a tree as a function of height, designs of seashell or pinecone or artichoke or pineapple fruitlets, 


seeds in a sunflower, family tree of honeybees, etc. 


Interlude 


In this short interlude we introduce a few mathematical bits and pieces which will be helpful in our future 


journey of mathematics. We will introduce a unit imaginary number i, Kronecker Delta 6;;, Dirac Delta function 


yj 


6(x), Levi-Civita symbol €;;, and Euler’s formula. 
Unit Imaginary Number: i = ¥—1 


The square root of some positive numbers is easy to figure out and remember, such as V4 = 2,V¥25 =5,etc. For 
other positive numbers, we may need a calculator, such as V3 = 1.732, V7 = 2.646, etc. But, our beloved 
calculator is not very helpful when we attempt to find out the square root of some negative numbers. For 


example, what is ¥—4 or ¥—7 ? For this purpose, we define the square root of —1 as a unit imaginary number, i. 


In terms of i we can write V—4 = ./(—1)(4) = 2ior V—7 = ¥(—1)(7) = 2.646 i. Various powers of i are i? = 


—1,i? = i#i = —i,i* = i2i? = 1. Using the fact that it = 1, we can reduce i”, where 7 is a large number, toa 
much simpler form. As an example, i = i16+1 = j or (135 = (33@)+3 = (3 = —j or (206 = (8442 = j2 = -1, 
etc. 


Kronecker Delta: 6;; 


Kronecker Delta is a symbol with two indices, i and j, and it can take two possible values, 0 and 1, depending on 
the values of the indices. The two indices, i and j, themselves can assume all possible discrete integer values 


from —oo to +00. Specifically, 
1ifi=j 
by =| mee Eq. (I.1a) 
Using this definition of Kronecker Delta, we can write 


». f (i) Oy = f (x). Eq. (1.1b) 


{=—0o 
Dirac Delta Function 
In the definition of the Kronecker Delta above, its two indices, i and j, can assume all possible discrete integer 
values from —oo to +0. Its analog in which these two indices can take all possible continuous values is referred 


to as Dirac Delta function. To make this connection more understandable, we rewrite the Kronecker Delta in an 


alternate notation as 
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oC =O if ty 0 (i,j =+++—2,-1,0,41,42-+ ] Eq. (I.2a) 


along with 


». f(x) 6G - jf) = f(x). Eq. (I.2b) 
i=—0 
Now, if 6 is generalized from discrete variables i and j to all possible continuous values, x and y, then the Dirac 


Delta function should have the properties: 


é(x—-y)=0ifx-—y+#0 or simply 6(x)=0 ifx+#0, E£q.(.3a) 


as well as 


| f (x) 6(x% — y) dx = f(y) or simply | 6(x)dx=1. £Eq.(I.3b) 
The Egs. (J. 3a and I.3b) imply that 6(x) cannot be zero for x = 0 because then the integral (which represents 
the area under the 5(x) versus x curve) will be zero, not 1. In order to understand the value of 6(0), consider a 


step function F(x), of width 2€ around x = 0 and height 1/(2e), 


0 for |x| >e 
1/(2e) for €>x2z-€ 


FOS { 


The area under this step function (or, the integral of the function F(x) over x from —oo to + 00) is 1. On taking 
the limit € — 0 in F(x), the step function will have value 0 for x # 0, value co for x = 0 and will preserve the 
area under the F(x) versus x curve to be 1. Thus, our functional definition of the Dirac Delta function, d(x), is 


any function which satisfies the following three conditions: 


d(x) =0 if x #0, d(x) =0 if x=0 and | d(x) dx = 1. Eq. (U.4) 
It is easy to show that the following limiting relationships, as € — 0, satisfy all three defining conditions of the 


Dirac Delta function and, therefore, can be taken as definitions of the Delta function itself: 


Le € 
5(x) Se ‘ Eq. (1.5a) 
1 i 
5(x) = —= lim—exp(—x?/e?) , Eq.(1.5b) 
Vt e>0 € 
ie SEs Fee 
6(x) = = lim = sin iS). Eq. (I.5c) 
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There are two comments worth mentioning here. First, unlike the Kronecker Delta which is a dimensionless 
number, the Dirac Delta function 6(x) has dimensions of 1/x for the integral of Eq. (1.4) to be meaningful. 
Second, the Dirac Delta function defined in Eq. (1.4) is a one-dimensional Delta function which can be generalized 


to two-dimensional and three-dimensional Dirac Delta functions. 
Levi-Civita symbol: €;;; 


Analogous to a Kronecker delta, one can define the Levi-Civita symbol €;;, with three indices (ijk). The three 
indices (ijk) can take only three values (1 or 2 or 3) each. The Levi-Civita symbol itself can have one of the three 


possible values, +1 or 0 or -1. Specifically, 


0 if any two indices are equal 
€ijx = ) +1 if an even exchange of (ijk) gives (123) Eq. U.6) 
—1 if an odd exchange of (ijk) gives (123) 
Then, out of 27 (3 times 3 times 3) possible Levi-Civita symbols, only six are nonzero. These are €,53 = 
+1, €432 = —1, €913 = —1, €93, = +1, €312 = +1,and €32, = —1. As we will see later, one can write the cross 
product of two vectors in a very compact form using the Levi-Civita symbol. Also, the value of a determinant of 


order 3 can be written using the Levi-Civita symbol. 


The Levi-Civita symbol is related to Kronecker delta by 
>: EijkEitm = 5j1 5m ie Orr: Eq. (1.7) 
i 
A nice mnemonics device to remember the order of indices in this relationship is illustrated in the figure below so 


that, right-hand-side of the relation is (first)(second) — (outer) (inner). 


J Rk Ll em 
FIRST 
SECOND 


Figure I.1. Mnemonic for the order of indices in the Epsilon-Delta Identity 
This relationship between Levi-Civita symbols and Kronecker deltas (called epsilon-delta identity) is very useful in 
deriving several identities of vector calculus as well as several properties of determinants. A proof of the epsilon- 


delta identity is given in Appendix C. 
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Euler’s Formula 


From Chapter 3, we recall the Maclaurin series expansion of some simple functions, such as: 


_ id (-1)"x2" 
cosSx = (ny! 
n=0 
as (-1)"x2"41 
aia » “(2n+1)!’ 
n=0 
co et 
expx = a 
n=0 


Now, on replacing the dummy index n by 2m in the first sum and n by 2m + 1 in the second sum, we get 


72mM+1,,.2m+1 


Sy j2My2m yj x 
a » (am)! * > (2m+1)! 
m=0 m=0 


_ ed (-1)™x2™ ; id (-1)™x2mt1 


(2m)! Ge (Qm+!’ 


m=0 


or 


exp(ix) = cosx +isinx. Eq. (1.8) 


This is known as Euler’s formula. By many accounts, Euler’s formula is the most beautiful equation of 
mathematics, or the jewel of mathematics. When written in the form e” + 1 = 0, it is made up of five different 
mathematical constants (namely, O, 1, 7, e, and i) each having its own independent value. Euler’s formula will be 


very helpful in our further studies of Fourier series, Fourier transform, and complex variables. 
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PROBLEMS FOR INTERLUDE 


1. Using properties of the Kronecker delta, show that 


and 


2. Using properties of the Levi-Civita symbol, show that 
> Eijk Eijm = 26km) 
ij 
and 
> Eijk ijk = 6. 
ijk 


3. Using Euler’s formula, show that 


cos x 
2 


exp(ix) — exp(—ix) 


sinx = 
21 


_ exp(ix) + exp(—ix) 


? 
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Chapter 4: Fourier Series 


Any periodic function can be expressed as an infinite series of sinusoidal (sine or cosine like) functions. This series 
is called the Fourier series of the periodic function. For a general nonperiodic function, the extension of the 

Fourier series leads to the Fourier transform. In this chapter, we will use orthogonality relationships of sinusoidal 
functions to derive coefficients in a Fourier series. We will also show how the Fourier series becomes the Fourier 


transform on increasing the periodicity of the periodic function. 
4.1 PERIODIC FUNCTIONS 


In preparation for our discussion of Fourier series and the Fourier transform, let us first get over some 
mathematical preliminaries. In our earlier discussion we derived an equation, the wave equation, whose solution 


describes any periodic function. In particular, we noted that a sinusoidal function 
f(x, t) = sin [kx + ot] 


satisfies the wave equation with k = 27/A,and w = 27/T. We also noted that a periodic function that repeats 
itself both in time (t) as well as in a spatial coordinate (x) is not a function of variables x or t separately but is a 
function of dimensionless variables kx + wt. In general, a periodic function f (@), of variable 0, repeats itself as 
the variable changes. The “periodicity” of such a repeating function is defined as the range of variable 8 over 
which the function repeats itself. For example, if f is a function of time, it repeats itself after an amount of time 
called its period, T. If f repeats itself over space, then the length over which the function repeats itself is the 
wavelength, A. So, we can define periodicity for these periodic functions as either T or A. As another example, 
the periodicity of a simple sinusoidal function, sin 6 or cos @, is 27. The integral of these functions over a 
complete range of their periodicity is the same no matter what the starting (lower limit) or ending (upper limit) 


points of the integral are. Specifically, 


a+2T 21 a+27 a 21 
sino dé =| sinodo + | sino dé — | sina do =| sina do, 
a 0 21 0 0 
a+27 21 a+27 a 21 
| cos a0 = | cosa.d0 + | coséao | cos@ao = | cosas, 
a 0 2m 0 0 


since in both cases a cancellation of integrals occurs in the middle step (on replacing the dummy variable @ by 
another variable w using 9 = 27 + w). Recall that a product of any two sinusoidal functions can be written as a 


linear combination of single sinusoidal functions, namely, 
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sina cos B = 5 [sina —B)+sin(a + B)], 
sina sin B = 5 [costa — B)-—cos(a + B)], 
cosacos Pp = 5 [costa — B)+cos(a + B)]. 


Thus, in general (with p and q integers), 


at+20 21 

| sin(p@) sin(q@) d@ = | sin(p@) sin(q@) dé , 
a 0 

a+27 2m 


cos(p@) cos(q@) dé = | cos(p@) cos(q@) dé , 


a+2T 21 


cos(p@) sin(q@) dé = | cos(p@) sin(q@) dé . 


Orthogonality Relations 


Recall that for an even function of 0, f(—@) = f(@), and for an odd function, f(—@) = —f(@). In our discussion 
of Fourier series and the Fourier transform, we will encounter integrals whose integrands are products of either 
two sine (ss), two cosine (cc), or a cosine and a sine (cs) functions. The complete range of periodicity of these 
functions, namely 272, is taken to be from — 2 to +7: so that sin @ is an odd function and cos @ is an even function 


of 8 over this complete range. Assuming that p and q are nonzero integers, we have 


1 


Io5 = | sin(p@) sin(q@) dé 


1 


1 { 1(sin(p+q)@  sin(p — q)@)|" 
=-—- | dO{cos[(p + q)0] — cos[(p — q)@]} = see 
7a {cosl(p + q WEES ee, ea - 
={° if p#q 
“la if p=q 
on ies = 107,- Eq. (4.1a) 


Similarly, 
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1 
Lo = | cos(p@) cos(q@) dé 


—1 


if von — 2 fsin@ +96 , sin — q)0)" 
= 5 | aotcostip + a)6] + cost(p - gay) =5{— PO" , OP } 


TT 


m if p=q 
Oh ice —Wo,,- Eq. (4.1b) 

Finally, 

™T 
i | costa) sin(q@) dé 

= 

i 1 (p+q)@ cos(p—q)0)" 

; ; cos(p + q cos(p — q 
=. | dO{sin[(p + q)@] —sin[(p — q)@ =e eee Eq. (4.1 
5 | ao{sin{(y + 496) lo - De} => aa = a q. (4.1c) 


=f 
= 0 for all integer values of p and q. 


We will refer to the three integrals /,.,/,, and I, as the orthogonality relationships for the sine and cosine 


functions. As an aside, we recall that Euler’s formula relates sine and cosine functions to exponential functions 


Thus, we can write an orthogonality relationship for exponential functions. More specifically, 


ie | exp(ip@) exp(—iq@) dO = | {cosa) + isin(p@)][cos(q@) — isin(q@)] dé 


| cos(p@)cos(q@) dé + | sin(p@)sin(q@) dO = 7 bpq + T Ong = 2S yg. Eq. (4.2) 


—1 


Jt 


Periodic Functions of Spatial Coordinate, x, and Time, ft. 


If the periodic function under consideration is a function of a spatial coordinate x, with periodicity of 2, then we 


replace 6 in the above examples by kx. Substituting 9 = kx = 2mx//, we get 


a 
sin(pkx) sin(qkx) dx = 3 Opa (p,q #0), Eq. (4.3a) 


I5. = 


| 
Nji~“—— NI 


a 
; a 
i | cos(rke) cos(qkx) dx = 3 Spa (p,q # 0), Eq. (4.3b) 
a 
“2 
and 
a 
2 
ia | coscrkex) sin(qkx) dx = 0, Eq. (4.3c) 
a 
“2 


for all integer values of p and q. 


If the periodic function under consideration is a function of time t, with periodicity of T, then we replace 6 by 


wt. Substituting 9 = wt = 2nt/T, 


ua 
F T 
lhe = | sin(pawt) sin(qwt) dt = 5 pa (p,q # 0), Eq. (4.4a) 
+ 
ul 
r T 
log = | cos(pat) cos(qwt) dt = 7 ona (p,q # 0), Eq. (4.4b) 
+t 
and 
ii 
z 
_— | cos(pwt) sin(qwt) dt = 0 for all integer values of p and q. Eq. (4.4c) 
- 
4.2 FOURIER SERIES 


Now, look at the three functions of variable 6 shown in Figure 4.1. Clearly these functions are periodic functions 
with periodicity of 27. The first function f,(@) is an even function of 6 since f,(@) = f,(—@). The second 
function f,(@) is an odd function of @ since f,(@) = —f,(—@). Finally, the third function f3(@) is neither even 
nor odd since there is no direct relationship between f;(@) and f,(—@). According to Fourier, any arbitrary 


periodic function f(@) can be represented as a linear combination of sinusoidal functions as: 


f(@) = > + > Ancos(mé) + ys bm sin(mé) Eq. (4.5) 
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Figure 4.1. Three periodic functions of 8: f,(@) is an even function of 0, f2(@) is an odd function of 8, and f3(0@) is neither even nor odd 
function of @. 


which is known as the Fourier series. Here Qo, d,,,and b,,are Fourier coefficients. Note an additional factor of % 
with coefficient aj. The mystery of this factor will become clear after we evaluate ay). Given a periodic function 

f (8), we can obtain its Fourier series by simply calculating its Fourier coefficients. To obtain these coefficients, 

we multiply Eq. (4.5) by either cos(p@) or sin(p@), where p is a nonzero integer, and integrate over 0 from 


-71 to 7m to get 
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1 


| cos(p6) f (6)d0 = = | cos(p6) dé 


—1 


© 1 foe) 1 
+ b) Am | costa) cos(m@) dé + » bn | costa) sin(mé) dé 
m=1 -1 m=1 -1 
=O+ b An tom, +0 =ap7, 
m=1 
or 
1 1 
Om = | cosmo) f(@)de m= 1. Eq. (4.6) 


Se: 
Similarly, on multiplying Eq. (4.5) by sin(p@) and going through similar steps, 
™T 


—— 2 | sin(mé@) f(@)dé m= 1. Eq. (4.7) 


—1 
Finally, integrating Eq. (4.5) directly, 


ao r 
Sf d0+0+0=agr 


—1 


ow = 


or 


1 1 
ag=— | fede, Fq.(48) 


which is the same as Eq. (4.6) with m = 0. The reason for associating an extra factor of % with ay was to ensure 
that evaluation of this coefficient using Eq. (4.8) is similar to the evaluation of a,,, (m = 1) using Eq. (4.6). This 


completes the evaluation of Fourier coefficients ay, a,,,and b,, for a general periodic function f(@). 
Even and Odd Functions 


Further simplification of Fourier coefficients occurs if the function f(@) is either an even function or an odd 
function of 8. An even function of variable @ is symmetric about the origin, namely, f(—@) = f (@), whereas for 
an odd function, f(—@) = —f(@). First, consider the Fourier coefficients b,,,. Split the integral in Eq. (4.7) into 


two parts as 
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bm = 2 f sincmo) f (@)d6 pe { snne) f(@)déd =1, +1. 
m 1 1 fares | ZS 


0 —1 


In the second integral, substitute y = —@ or 8 = —y, and note that for an even function 


1 0 
[sin(-my)]f(-y)d(—y) = -| sin(my) f(y)d(y) = —h. 


Tw 


| 
alr 
sparen 


So, b,, = 0 for an even function f(@). Similarly, for an odd function, I, = 1, so that b,, # 0 and it is given by 


1 


2 
bm = -| sin(mé) f (@)dé. 
0 
Next, consider the Fourier coefficients a,,. In this case, splitting the integral of Eq. (4.6) into two parts gives 
ag 1f¢ 
a= -| cos(m@) f (@)dé soe [ cosmo) f(@)d0 =J, +Jo. 


0 -—1 


Again, in the second integral, substitute y = —@ or 8 = —y, and for an odd function, f(—@) = —f(@), so that 


fe 1 T eas(mpIf eae 1 f cosemoy doh = =] 
a y y y y. yay) = 


1 


Thus, for an odd function, a,, = 0, including ag. On the other hand, for an even function, J, = J, so that a,, # 0 
and it is given by 
1 
2 
an = -| cos(m@) f (@)dé. 


0 


This is really not a very surprising result since it implies that for an odd function [f (—@) = —f(@)], the Fourier 


series consists of only odd sine functions: 


f(0) =) bm sin(md) , 


m=1 


while for an even function [f(—@) = f(@)], the Fourier series consists of only even cosine functions: 
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f@) = = + >: AmCOs (m@). 


Here are examples of determining Fourier series for three different periodic functions. The first function is an 
odd function of 8, the second function is an even function of 8, and the third function is neither an even nor odd 


function of 0. 
Example: Given a periodic function, 
f(0)=0 for-n <O0<T1 [Sawtooth function], 


determine its Fourier series. 


Figure 4.2. The sawtooth function, f(@), as a function of @. 


Solution: As seen in Figure 4.2, f(@) is an odd function of @. So, all a,, = 0 (including ay), and 


f(@) = > bm sin(mé) 


with 


bm = 2 sintmo) (046 = 2 fo simone) dé 
"md mi! 
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- “|. 6 cos(mé)|” Fi (2 | 


TT m m 
; 
0 


= east) = (-1ym 2 
m m 


0 0 


sin(m@) 


_2 1 Gio 
re aS MIT “we 


Thus, 
f(@)=90=2 >. Ss sin(mé), 


ens ae Ae. Det sa 
=2 {sin(6) = 7 sin(28) + 3 sin(39) = qsin(49) + } 


This is the Fourier series for the sawtooth function. Figure 4.3 shows the sawtooth function as well as the sum of 


first N terms of the Fourier series (red dotted line) for N = 2,5,10, and 100. 
Clearly, the depiction of the sawtooth function by the Fourier series becomes more accurate as N increases. 


Example: Find the Fourier series for a periodic function f(@) which is defined in the interval-7 < 0 < mas 


f(0) = 07. 


Solution: Since f(@) = 97 is an even function of 0, b,, = 0 form = 1. Multiply both sides of the above function 


by cos né@ and integrate over 0 from — z to 7 to get 


1 1 (oe) Tw 
a 
[ro cos(n@) d@ = = | cos(n@) dé + >. Am | cos(m@) cos né dé 
—1 —1 m=1 -1 


=0+7Aa, 
or 


27 26? sin(n@)|" 
ay ==] 6 cos(nd) d= =— = 
0 Q 0 


Tw 
2 i 0 
== f 20 a 
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Figure 4.3. The sum of first N terms in the Fourier series of a sawtooth function. 
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cos(n@) 
n 


"4 r cos(n@) 


nit n 
0 


dé 


Ae 4 
= -— | 6 sin(n@) dd = —8@ 
nm nT 
0 


=~ ale cos(nn)| — Pig y”. 


Next, integrate the given function from - 7 to 7 to get 


1 1 foe) 1 
ag 
| ero -3 | 20+ yy mn | cos(mayae 
=IC —1 m=1 -—1 
or 
63\" ay 2 a 2 
— = — = Bu 20s = — 2 
Z| 76| +0 or 5m 3 2m OF Ag ra 
Thus, 


f(@) = a + ~ ae cos(mé). 


m2 


Example: Determine the Fourier series for a unit step function defined as 


_ £0 for —-1w7<0<0 
f@) = {, for 0<0<z 
Solution: The function f (@) is neither an odd function nor an even function of 9, so the Fourier series for f (0) 


will consist of all Fourier coefficients, ay,a,,,and b,,, namely, 


co 


f(@¢) = = + De Am cos(mé) + >. bm sin(mé) 


m=1 m=1 


We evaluate these coefficients using Eqs. (4.6) to Eq. (4.8), 


Lp ‘ep 1 


T 
=0 m2=1 
0 


1 


ee [ sinomoy @yao =f sinomo) ie - 1 n--p 
mt 1 o£ > mm : 


m 
—1 


or, 


0 for m = 2,4,6,8 ... 
=i 2 ‘ 
Pm {2 for m = 1,3,5,7 ... 
™m 
So, the Fourier series for f (0) is 
sin(@ sin(30 sin(50 sin(70 
y= eR 0G 


Complex Fourier Series 


Since Euler’s formula relates sine and cosine functions to exponential functions, it is possible to write the Fourier 


series as a series of exponential functions instead of sine and cosine functions, 


(62) ge) ea) 


m=1 


= + YG) roc + YS) oxime 


m=1 


On changing m to - m in the second sum, we have 


a 
C= > 
ae : form = 1,2 , 
and 
= om FO form = 1 —2,++— © 
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In terms of these new coefficients, the Fourier series becomes 


co 


f(@= >» Cm exp(im@) . Eq. (4.9a) 


m=—00 


This is the complex (or exponential) form of the Fourier series. The coefficients c,, are obtained by using the 


orthogonality relation [Eq. 4.2] for exponential functions, 
1 ™ 
Ca. = = | f(@) exp(—imw) dw. Eq. (4.9b) 
—1 


Example: Let us repeat the sawtooth function, 

f(@) =9 for —-w<O<T7. 
using the complex Fourier series. 
Solution: 


The Fourier coefficients for this complex series are, 


1 1 
1 ; 1 \0 exp(—im@) - exp(—im6) 
Cm = 5 J eexo(-ima aa = = Pern a ee form #0, 
a —1 
i exp(—imé) 


_ 1 (@exp(—imé) 
ioe 


J 


at - exp(—imm) (-—7)exp(imm)  exp(—imm) | 
a 


—] —] 2 
im = (—im) 


—im —im (—im)? (—im)? 


CDP. te" 
m 


1 i 
= ——cos(mzm) — —-ssin(mz) = -— orm # 0. 
im cro) mm? on) im f 


Form = 0, 


So, the complex Fourier series looks like 
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i(—1)™ 


exp(im@) + >. exp(im@) 


m=-1 


f(@) =9=0+ y 


On changing m to - m in the second sum, we have 


f(@) = ye ews exp(im@) + >< oe se  exp(—imo) 


m=1 


= = = ee [exp(im@) — exp(—im@)] = 2 a — _ sin(m8), 


m=1 


the same as before. 


4.3 FOURIER TRANSFORMS 


In our discussion of Fourier series, our motivation was to write any arbitrary periodic function as a series of some 
well-known periodic functions—namely, sine and cosine functions. A question arises: Can we write something 
analogous to a Fourier series for an arbitrary function that does not appear to be a periodic function? Let us 


explore this possibility. Look at an isolated step-like function in space as shown in the Figure 4.4a. 


Figure 4.4a. An isolated step-like function in space. 


At first sight this function appears as a non-periodic step function in space. Now, imagine starting with a periodic 
step function, as in part a of Figure 4.4b, of some definite wavelength as indicated. If we were to remove 
alternate spikes in part a of this figure, we will end up with part b, which still shows up as a periodic function, 
except with a longer wavelength. We repeat the process of removing alternate spikes in part b and end up with 
part c, which also shows a periodic function but with a still-longer wavelength. If we continue the process of 
removing alternate spikes from each new figure ad infinitum, we will end up with the isolated function of Figure 


4.4a. 
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KA > a 


K—A—>| b 
—1..-—> c 
d 


Figure 4.4b. A periodic function with an infinite wavelength appears as an isolated function. 


This procedure suggests that an equivalent of the Fourier series of an isolated function, of spatial coordinate x, 


will be obtained by taking the limit of an infinite wavelength (A > 00). Using complex Fourier series, with @ = 


kx we have 
f=) cnexplimkx), 
where 
A/2 
— 1 4 
in = 5 | FB) exp(—imkap) ap. 
-A/2 
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The dummy variable, m, in the infinite sum above goes from —oo to +00 in steps of 1, that is, Am = 1. Now we 


make a change of dummy variable in this infinite sum from m to q, defined as q = mk = m(2m/A). Then, 


Note that values of q are discrete with step size 277/A. When A > ©, the sum over m becomes an integral over 


q, and Aq — dq. Then, 


ee) Aa/2 
A 
f(x) = lim | expliqn) =~ | f(’) exp(—igx’) dx’, 
mor q=-0 —A/2 
or 
1 f a T 4 if I 
fod = 5 | da | FO) expliae - x'yIdx. 
If we define 
F(q) == [ry (-iqx’) dex’ 
a x’) exp(—igx’) dx’, 
q Vin . pl—lq 
then 


1 [oe} 
f(x) = rl dq F(q) exp(iqx). 


F(q) is called the Fourier transform of f(x) and vice versa. The factor of 1/27 can be distributed in many ways 
between F and f. However, it is a common practice to distribute 1/V 27 with function f and 1/V 27 with function 


F to preserve symmetry between a function and its Fourier transform. 


Let us then summarize the relationship between a function and its Fourier transform. Consider a pair of two 
physical quantities, uw and v, whose product is dimensionless and, therefore, can be measured in radians. Such 
variables are called conjugate variables. We have seen an example of this in our discussion of the wave equation, 
namely the pair of wave number k and spatial coordinate x such that the product kx is dimensionless. Another 
pair consisting of angular frequency w and time t is such that the product wt is dimensionless. Since u and v are 
conjugate variables, the Fourier transform “transforms” the u-dependent function f(u) into a completely 


equivalent representation F(v), a v-dependent function, in the following way: 
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1 co 
F(v) = ie | f (uw) exp(-iuv) du. 


F (v) is called the Fourier transform of f (uw). The inverse Fourier transform creates the opposite transformation, 


namely, 


fWwe= F(v) exp(iuv) dv. 


va 


Thus, f(u) is the Fourier transform of F(v). In particular, a function of spatial coordinate x (position in space) is 


cast into an equivalent function of wavenumber k (momentum space) by the Fourier transformation, 


1 
F(k) -7a | f (x) exp(—ikx) dx, Eq. (4.10a) 
and its inverse, 
are: fr k ikx) dk Eq. (4.11 
f(x) "Van J (k) exp(ikx) dk. q. (4.11a) 


As an aside, k-space is also called momentum space (or, p-space) since in quantum physics p = hk, where his 
the universal Planck’s constant. Similarly, using the Fourier transformation, a temporal function of time t is cast 


into an equivalent function of frequency w as 


1 co 
F(w) “7a | f(t) exp(—iwt) dt, Eq. (4.10b) 
and its inverse, 
(t) = : fe ) exp(iwt) d Eq. (4.11b 
ff "Pas w) exp(iwt) dw. q. (4.11b) 


Example: Determine the Fourier transform of a Gaussian function f(x) given in the coordinate space as 


x? 1 x? 
f(x) = f(0) exp (- =) 7 aoe (-3). 


Solution: The constant in front, namely, f(0) = 1/(avr), normalizes the Gaussian function as 
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[ rooae = 1. 


The integral here is same as integral Le of Eq. (2.14). The bell-shaped function f (x) has its maximum value, 
f (0), for x = 0 and it goes to zero as x — +00. The extent of values of x, namely Ax, over which the 


function f (x) is appreciable is given by the full width of the function at half of its maximum value (FWHM). Thus, 


Ax = FWHM = 2avin2. Eq. (4.12a) 


In other words, the parameter a in the Gaussian function is a measure of the approximate width of the function; 
the larger the parameter a is, the wider the Gaussian function, and vice versa. Figure 4.5 shows the function 


f (x) for a = 1,2,and 3. 


Gaussian Function, f(x)/f(O) = exp(-x7/a’) 


aes 

2 

is) 

c 

2 °” 

- exp(-x~/1) 
S ° 

s —====— exp(-x Re 
S169 #7 FINS jj qcaadivomme exp(-x/9) 
O 


Figure 4.5. The Gaussian function f(x)/f(0) = exp(—x*/a?) for various values of a. 


The Fourier transform of f (x)/f (0) is 


F(k) ee exp ae exp[—ikx] dx. 
Van J Pa 


On perfecting the square in the exponent, we get 


4a? 


1 1/,. >... Cika®)?\  (ika?)? 
FQ) == | exp —a\* + ika*x + 7 + dx 
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1 se exp k7a “| [« p|- ee ae 


7 2 
On making a change of variable from x to u, using u = = 


1 
Se a - 


exp(—u7) adu, 


a a 
= —exp ]—- ——_], 
| 4 


on using the integral iy of Eq. (2.14). We note that the Fourier transform F(k) of a Gaussian function in the 
coordinate space is also a Gaussian function, but in the k-space. The extent of values of k, namely Ak, over 
which the function F(k) is appreciable is again given by the full width of this function at half of its maximum 


value (FWHM). Thus, 


4 
Ak = FWHM =—Vin2. Eq. (4.12b) 


Now the parameter a has an inverse relationship to the width of the function F(k); the larger the parameter a 
is, the thinner the function F(x). Note, the product of Ax and of Ak in Eq. (4.12) is independent of a. This 


essentially is the statement of the Uncertainty Principle in quantum physics. 


Example: Determine the Fourier transform of a single rectangular pulse in the coordinate space looking like, 


1 
=~ -ascx<a 
f(x) =4(@a) 
0, |x| >a 
Solution: For this function, 
co a 
[ rooue=2 far=1 
f(x)dx = aa x=1. 
—00 -a 


Furthermore, the extent of the values of x over which the function is appreciable is, 


Ax = 2a. Eq. (4.13a) 


The Fourier transform of the single rectangular pulse is 
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co a 
1 1 1 
Fdo === | x) exp(—ikx dx = | ex —ikx) dx 
CO Sg J PD exwtikay d= Fe | exm(—te) 
_ 1 1 exp(-ikx)|* i. 4 i (ika)] 
= 7594 ik | ~~ 2aik JE exp(—ika) — exp(ika 
_ 1 1 di sin(kayl = 1 sin(ka) 
= 2aik Jaq isin( Ol ee a ; 
Figure 4.6 shows F'(k) as a function of the variable k. It is a wiggly function with the central wiggle being the 
largest one, extending from k = —1/a to +7/a. 
152, 
1.0 
0.8 
y 0.6 
x 
— 04 
2 
c 0.2 
7) 
0.0 
-0.2 
-0.4 
-40 -3% -2% -1% 0 1m 2m 3m At 


ka (in radians) 
Figure 4.6. The function sin(ka) /(ka) as a function of ka. 


Thus, the extent of the values of k over which F(k) is appreciable is, 


21 
Ak = oa. Eq. (4.13b) 


Once again, the product of Ax and of Ak, in Eqs. (4.13), is independent of a, which confirms the statement of the 


Uncertainty Principle in quantum physics. 


From examples of Fourier transforms, it can be concluded that if the extent of a function is very wide, then the 
extent of its Fourier transform is very narrow and vice versa. This fact is very useful in defining the reciprocal 


lattices in condensed matter physics. 
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PROBLEMS FOR CHAPTER 4 


1. Show explicitly, for p and q as positive integers and a as an arbitrary constant, 


a+27 21 
sin(p8) sin(q@) dé = i sin(p@) sin(q@) dé, 
a 0 
a+27 21 


cos(p@) cos(q@) dé = | cos(p@) cos(q@) dé, 


a 


a+2T 21 


sin(p@) cos(q@) dé = i sin(p@) cos(q@) dé. 


a 


2. Find the Fourier series for a periodic function f(@) which is defined in the interval-7 < @ < mas 


_ jute for -m < 
fO= {og for O< 


oo 


3. Determine the Fourier series for the “square” wave defined as 


a 
—1 for—5<x<0 
f(x) = 


a 
1 for0<x<-5 


This series appears in discussions of high frequency electronic circuits. 
4. Determine the Fourier series for the function defined as 

f(t) = |sinwt| for —m < wt <7. 
This series appears in discussions of the full-wave rectifiers in electronics. 
5. Determine the Fourier series for the function defined as 


sinwt for 0<wt<aZ 
t)= 
F() { 0 for -mt<wt<0 


This series appears in discussions of the half-wave rectifiers in electronics. 


6. Consider a periodic function f (@) which is defined in the interval-7z < @ < mas 


1 
0 


IA IA 


(a) ls the function f(@) an even function or an odd function or neither? 
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(b) Determine the Fourier series for f(@). 
7. Find the Fourier series for a periodic function f(@) which is defined as 
f(@)= |0| for -x <6 <7. 
8. Show that the Fourier transform of the function 
TUX f es 
f= {e°*(za) or —-a<x<a 
0 otherwise 


(where a is a constant) is 


a TU 
F(k) = Vin (— eae aa cos(ka). 


Plot f(x) as a function of x and F(k) as a function of k. Using these plots, estimate the extents Ax and Ak of 
the functions f(x) and F(k), respectively. Show that Ax times Ak is independent of a. 

{Hint: Use cos x = [exp(ix) + exp(—ix)]/2. } 
9. Show that the Fourier transform of the parabolic function 


_ (a2? — x? for |x|<a 
Dy to for |x| >a 


a3 
F(k) = — 


4 
lew [sin(ka) — (ka) cos(ka)]}. 


Plot f(x) as a function of x and F(k) as a function of k. Using these plots, estimate the extents Ax and Ak of 
the functions f(x) and F(k), respectively. Show that Ax times Ak is independent of a. 
10. Show that the Fourier transform of the triangular pulse function, 


_ (1-|x\l/a for |x| <a 
fa= {5 for |x| >a 


(where a is a positive constant) is 


_ a fsin(ka/2) 2 


Plot f(x) as a function of x and F(k) as a function of k. Using these plots, estimate the extents Ax and Ak of 


the functions f(x) and F(k), respectively. Show that Ax times Ak is independent of a. 
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Chapter 5: Complex Variables 


In this chapter we will describe algebra related to complex numbers. Representations of complex numbers in 
two-dimensional planes will be explained with several examples. Algebraic properties of addition, subtraction, 
multiplication, and division of complex variables will be outlined. Finally, DeMoivre’s formula will be derived and 


applied to obtain some useful trigonometric identities. 
5.1 COMPLEX NUMBERS AND COMPLEX ALGEBRA 


A complex number is defined as an ordered pair of two real numbers (x, y) in which the first number, x, is called 

the real part of the complex number and the second number, y, is called the imaginary part of the complex 

number. It is customary to reserve the letter z for complex numbers and write the complex number as 
Z=xtiy. Eq. (5.1a) 


The real part of z, namely Re(z), is x and the imaginary part of z, namely /m(z), is y. Complex numbers can be 
represented as points in a two-dimensional plane, which is analogous to the common x- y plane and is called the 
complex plane or the z-plane. Because of its similarity with Cartesian coordinates, the representation of a 
complex number as z = x + iy is called the Cartesian representation of the complex number. In a plane one can 
switch from Cartesian coordinates (x, y) to plane polar coordinates (r, 8). From Figure 5.1, 


x=rcosé,y=rsin6. Eq. (5.2a) 


Thus, a complex number can be written as 


z=r(cos@ +isin@) = rexp(i@). Eq.(5.1b) 


Here r is the magnitude of z and @ is the argument or phase of z. The magnitude of a complex number is also 


commonly written as |z| so that 7 = |z]. Note, 


r= /x?+y?,0 = arg(z) = arctan (-). Eq. (5.2b) 


The representation of a complex number as z = |z| exp(i@) = r exp(i@) is called the polar or exponential 


representation of the complex number. 


The complex conjugate of a complex number is obtained by replacing i by -i and is denoted by placing a bar on 
top of the number. Thus Z = r exp(—i@) or Zz = x — iy. Note |z| =r = |Z|. Also zz =r? = |z|? =x? + y?. 


Thus, 
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|Re(z)| < |z| and [Im(z)| < |zI. 


z - plane 

Z=x+iy 
~-_~ 
N 
c 
R= 

Re (z) 
Z=x-ly 


Figure 5.1. Cartesian and polar (or exponential) representation of a complex number. 


Example: Given, z; = 2 + 5i, determine Re(z,), Im(z,), |Z1|, phase angle 8, complex conjugate Z; and 


exponential representation of Z;. 


Solution: Looking at the complex number z, = 2 + 5i, we note Re(z,) = 2, Im(z,) =5, |z,| = v2? +54 = 29, 


tan@ = ae 2.5, 0 = 68.2° = 1.19 radians, Z, = 2 — 5i, and z, = ¥29 exp(1.19 i). 


Example: Given z, = 4 + 3i, determine Re(z,), Im(zz), |z2|, phase angle 0, complex conjugate Z>, and 


exponential representation of Z>. 


Solution: Again, looking at the complex number z, = 4 + 3i, we note Re(z,) = 4, Im(z,) = 3, |Z| = 


V42 +34=5, tan@ = = = 0.75, 8 = 36.9° = 0.64 radians, Z, = 4 — 3i, and z, = 5 exp(0.64 i). 
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5.2 PROPERTIES OF COMPLEX NUMBERS 


Here is a list of some general properties of complex numbers. 


Property Number 1: Addition or subtraction of two complex numbers is achieved by adding or subtracting the 


real parts and the imaginary parts separately. If 


Z, =X, tiy, and Zz, = xz + iyo, 


then 


2, £2, = (4%, £X2) +101 + 2). 


The addition of two complex numbers is both commutative as well as associative, 


Zy +22 =2Z,4+72,, 


Z, + (Z2 +Z3) = (4, + 22) +23. 


The product of two complex numbers is treated as simple polynomial multiplication, 


ZyZy = (X1 + iy) (X2 + ive) = (%1X2 — WrY2) + 1X V2 + X2Y1). 


The product of two complex numbers is commutative and associative as well as distributive, 


2422 = 222), 


21 (2223) = (4122)23, 


Z1(Z2 + 23) = 2,Z_ + 2423. 


Property Number 2: |f two complex numbers, z, = x, + iy, and Z, = x2 + iy2, are equal, then their real and 
imaginary parts are separately equal. In other words, z, = Z, implies x; = x, and y, = y2.Ifz =x +iy =0, 


then it implies that both x = 0 and y = 0 simultaneously. 


Property Number 3: \f 2,Z, = 0 it implies that either z, = 0 or z, = 0 or both are zero. To prove this fact, we 


note 


ZyZq = (X1 + iy) (X2 + ya) = (% 1X2 — WzV2) + i(Xyy2 + X21) = Oz 


Thus x1X2 —¥1¥2 =O and x,y. + xX2y, = 0. Therefore, 
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(1X2 — WiV2)? + (X12 + X2V1)* = 0, 
or 
xt (xz + yz) tytOg +7) = 0, 
or 
(xt + yi)(xz + yz) = 0. 


Thus, either x? + y? = 0, which implies both x, =0 and y, = 0,thatis, z, = 0,orx? + yz =0, 


which implies both x, = 0 and y, = 0, thatis, z, = 0, or both z, and Zz, are zero. Note in passing, 
12422] = {0x3 + v7) Og + yZ)}'? = lz Ilzel. 


Property Number 4: Division of one complex number by another complex number works as follows. If z, = x, + 


iy, and Zz, = X, + iy, # 0, then 


2% tly, 242, _ (x1 + ty, ) (x2 — ty2) _ (x2 + V1V2) 1X2 — X1Y2) 


= a 2 2 2 2 2 2 
Zag Xg4Fly2 2222 xz + V3 xz + V3 x2 + 32 


Property Number 5: The properties of complex conjugation imply 


Z,42,=2, tz, 


222 = ZZ, 


E\-2 wen, 


@ =z 


If Z = z then z is pure real, that is, Im(z) = 0 and if Zz = —z then z is pure imaginary, that is, Re(z) = 0. In fact, 


Z+Z Z—Z 
Re(z) = pe? Im(z) = op 


Property Number 6: lf z = r exp(i@) = x + iy then 6 = arg(z) = arctan(y/x), and 


arg(Z,Z2) = arg(z,) + arg(z2), 


ane) = ace) ~ arte. 
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Example: In this example, we take the two complex numbers that we used in the previous two examples, 


namely, z; = 2 + 5iand z, = 4 + 3i, and illustrate some of the properties that we discussed above. 


Solution: 


|z, + z,| = V62 + 8? = V100 = 10, 


Z,+2Z, =6+4 8i, 
Zy—- 22 = -24+21 |Z, — Z.| =./(—2)? + 2? = vB, 


ZZ) = (2 + 5i)(4 + 3i) = 8 + 201 + Gi +: i215 = —7 + 26i, 


|z,Z5| = J/(—7)2 + 262 = V49 + 676 = V725 = V25 * 29 = 5V29 = |z,||zol, 


2 Se Deere ose os Be 
Zz 2+5i2—-5i. 29 +229 + «29” 


1 be ae eG et 3 


Se tl, 


ZZ &+3i4—31. 25 25 25 


Here is a list of some items which are so important that one should commit them to memory. The useful 


things to remember, for k an integer (positive or negative) or zero, are: 
exp(im/2) =i 


exp(—im/2) = —i 


exp(ikm) = (—1)* 


exp(i2km) =+1 


Based on these useful items to remember, Property Number 2 for two complex numbers, in exponential form, 


can be written as follows. If z; = r, exp(i@,) and z, = r, exp(iO,), then z, = z, implies 7, = 7, and 6, = 6, + 


2k, where k is an integer including zero. 


In some situations, dividing by a complex number or finding the reciprocal of a complex number is best 


accomplished by using the exponential form of the complex number instead of using Property Number 4. The 


following example will illustrate this point. 
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Example: Write the complex number z = 4i/(V3 + i) in its Cartesian form. 


Solution: We write the numerator and denominator of the complex number z in exponential form as 


4i 4 exp(im/2) in in = i 
° V3+i 2exp(im/6) ( 2 ~) exp(im/3) (cos 3 +isin 5) + iv3. 


5.3 POWERS OF COMPLEX NUMBERS 
In order to calculate powers of a complex number, Zz, it is best to work with exponential notation, namely, 
Zz” = (x+ iy)” =r" exp(inég). 
Example: Given z = ; + is determine all higher powers of z. 


Solution: First, we write z in its polar or exponential form. Since r = [= + - = Land @ = arctanvV3 = a it 


follows that in exponential form 
ae (=) 
z= lexp\=). 


Then, 


Im (z) : 
z=! +193 = 1.0 exp(in/3) 


Figure 5.2. Powers of a complex number z = 1 exp (=). 
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2a (=) 
Z” = Lexp zs 7 


z> = 1exp(zi), 


67 . 
z®° = 1exp (=) = lexp(2mi) = 1. 


Since z® = 1, all higher powers of z, starting with z” onwards, take one of the six values above. These six values 


are shown in a complex plane in Figure 5.2. They lie on a circle of unit radius. If the magnitude r is different from 


1, then we get a spiral instead of a circle. For example, for r = 1.1, 


44 (=) 
z= 1.lexp|>], 
z* = 1.21 exp(ni), 


31i 
z? = 1.331 exp (=), 


z* = 1.4641 exp(2zi), etc. 


Im (z) 
z= 1.1 exp(in/2) 


= 
iN 
pte 


N 


Figure 5.3. Powers of a complex number z = 1.1 exp (=). 
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In the complex plane these values appear on an ever-expanding spiral, as seen in Figure 5.3. Similarly, if the 
magnitude of a complex number z is less than 1, that is r < 1, then all higher powers of z lie in the complex 


plane on an ever-contracting spiral. Also, for r # 1, all higher powers of z have a separate distinct value. 
Example: Given z = 1 + i, what is 2°? 


Solution: First, write the complex number z in polar or exponential form. Note r = V1 + 1 = V2 andtan@ = - 


so that 0 = 17/4. So, 


im 
z=1+i=VZexp(+). 


Then, taking any higher power of z is straightforward, 
z= (v2)° exp (8 *) = 2* exp(2mi) = 16. 
However, the long method would be to multiply 1 + i by itself eight times: 
(141)? =14+2i+i? =14+2i-1 =2i, 
14+0)*7=0+4+070 +i? = 2D(2) = 4i? = -4, 
(14+i0)% =(1+i40 4+? = (4)? = 16. 
5.4 ROOTS OF A COMPLEX NUMBER 


Roots, such as square root or cube root, of a complex number are also evaluated most conveniently by using 
exponential form of the complex number. As a practice run, let us evaluate roots of +1 and —1. From algebra we 


know that a polynomial of order n, 
1+a,z+a,z7+--+a,z" =0, 


where coefficients a; can be real or complex, has exactly n roots. Some roots may be repeated. Thus, roots of a 


simple polynomial 1 — z” = 0 are the n roots of unity or +1. To obtain these roots, write 


Zz” =4+1=exp(2mki) k = 0,1,2... 
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Then, Zz = exp (= i) , which for different values of k provides n roots of unity. For k = 0,1,...(n — 1), these 


roots are, 1, exp (=) ,exp (=) ,exp (=), . @XP (Fe) . The next value of k, namely k = n, gives back the 


n 


first root, that is, z = 1. 


Similarly, from z” +1=0 or z” = —1 = exp(ai) exp(27ki) = exp[m(2k + 1)i], the n roots of —1 are 


exp (=) ,exp (=) ,exp (=) ya XD (=) . The next value of k, namely k = n, gives back the first root, that 
is, Z = exp (=). 
Example: Determine all possible values of square roots of +1. 
Solution: In this case, 

z*—-1=0 or z*?=1=exp(2mki) 

or z = exp(zki) = z, for k = 0,1,2,.... 
The mathematical symbol = means that it is an equality for any value of variable k. Thus, for k = 0 and 1, 

Z) = +1andz, = —1. 

Higher values of k simply keep repeating values of Z) and Z,, that is, 


Z,y = +1 fork = 2,4,6,... and z, = —1 fork = 3,5,7,... 


Thus, there are only two independent square roots of +1 and their values are +1. In the complex plane, these 


two roots of +1 lie on a unit circle as seen in Figure 5.4. 


Im (z) 


'k=0, 2, 4,... 
+> 
k=1,3, 5... ' Re (z) 


Figure 5.4. Square roots of +1. 
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Example: Determine all possible values of cube roots of +1. 


Solution: In this case 
z7-1=0 or z?=1=exp(2zmki) 
2u 
or, Z=exp (xi) = z, for k = 0,1,2, ... 


Zy = +1, 


2T © 2m\  . . (2n\ —1+iv3 
Z, = exp (=i) = cos (=) + tsin([) = 


Im (z) 
\ 
\ 
\ 
nd A a ae 
of be : ™ 
; ff 4 Re (2) 
‘ ¥ ‘ 
, / 
r=2,5.2% 
? 
J 
ph 
/ 


Figure 5.5. Cube roots of +1. 


Ar. An, .. (4m\ —1-iv3 
Z2 = exp (=i) = COS (=) +tsin (=) = a 


Since z,; = exp (27i) = +1, which is same as Zo, the values of all z, for k = 3 are repeated. So, there are only 
three independent cube roots of +1, namely, Zo, Z,,and Z,. Again, these three cube roots of +1 are seen to lie on 


a unit circle in the complex plane as in Figure 5.5. 


Example: Given z = 4i, what is z!/2? 


Solution: First, convert z into its exponential form, r = V0 + 16 = 4 and tan@ = 7 = oor@ = 7/2, s0 that 
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z= 4exp (‘5) exp(27ki) 


z/2 = 2 exp(im/4) exp(mki) = z, for k = 0,1,2..... 


Fork = 0 and k = 1, we get 


1 1 2 2 
Zo = 2(cos> + isinz) = 2(F +i) = V2(1+ 2), 


ae 


= 2{ oF + isin) =2 
Z, = 2\cos[-t+isin[-) = ae bs 


7 )- —V2(1 + i). 


For higher values of k, k = 2,3, 4,..., the roots Z) and z, are repeated. Thus, z = 4i has only two independent 


square roots, +V2(1 + i). 
Example: Given z = 1 — V3i, what is Vz? 
Solution: Again, converting the complex number from its Cartesian form to its polar or exponential form, we get 
z=1-V3i = 2exp(—im/3) exp(2mki). 
Then, 
z/2 = V2 exp(—im/6) exp(mki) = z, for k = 0,1,2,.... 


For k = 0 and k = 1, we have 


Zs = V2 (cos= — isin=) = v3($-i5) - fo “ 


yy 3 V3 1 3 1 
aa sAosS an 8) a(S id) oi 


Again, for higher values of k, k = 2, 3,4, ..., the roots Z) and z, are repeated. Thus, 


z = 1—V3i has only two independent square roots, +f Fi = 
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5.5 DEMOIVRE’S FORMULA 


In the Interlude section we introduced the Euler’s formula, namely, 
exp(ix) =cosx +isinx. 
If we put x = n@ in Euler’s formula, we get 
exp(in@) = cos(n@) + isin(né) . 
Also, 
exp(in@) = (exp(i@))" = (cos@ +isin@)” . 
So, 
(cos 8 + isin 8)" = cos(n@) + isin(n@) Eq. (5.3) 

which is known as DeMoivre’s formula. 

Applications of DeMoivre’s Formula 


In some situations, it becomes useful to write powers of a sine or a cosine function, such as sin" 6 or cos" @ 
[integer n], in terms of sine or cosine of various multiple angles like sin(m@) or cos(m8@) [integer m]. This can be 
accomplished by a simple application of DeMoivre’s formula. This formula also allows us to write 

sin(n@) or cos(n@) [integer n] in terms of multiple powers of sine or cosine, as sin™ 8 or cos™ 6 [integer m]. 


For convenience, write 
z=cos@ +isin@ = exp(id), 
then 
z1 = [exp(i0)]~1 = cos @ — ising, 
and z” = exp(in@) ,z~" = exp(—iné). 
Then, 


z" +27" = exp(in@) + exp(—in@) = 2 cos(né), Eq. (5.4a) 


and 


109 


z” —z-" = exp(in@) — exp(—in@) = 2isin(né). Eq. (5.4b) 


These relationships along with DeMoivre’s formula allow us to accomplish what we set out to do. 
Example: Write cos?@ and sin‘@ in terms of sine or cosine of angles which are multiples of 0. 


Solution: Using Eqs. (5.4), 


: ro ee | 1 
cos?@ = = ria +3z4+3z14+2z%)= g (lz +273] + 3[z+2z74)) 


[cos(36) + 3 cos 6] 


1 
[2 cos(38) + 6cos 8] = 7 


1 
8 


and 


4 
z—-z 1 
sin*@ = (=F) — Taal —4277+6-—427°74+27%) 


7 (2 +z-*] —4[z? + z-?] + 6) 


1 
ae [2 cos(40) — 4-2 cos(26@) + 6] 


[cos(40) — 4cos(20) + 3]. 


a 
8 
Example: Write cos(46), sin(46@), cos(20), and sin(26) in terms of multiples of sin 8 and cos 6. 
Solution: Starting with DeMoivre’s formula, 
cos(46) + isin(4@) = (cos@ +isin@)* 
= cos*@ + 4 cos? (isin) + 6 cos76 (isin@)* + 4cos@ (isin@)? + (isin@)* 
= (cos*@ — 6 cos?6 sin?6 + sin*@) + 4i (cos?6 sin @ — cos 6 sin?6) 
Separating out the real and imaginary parts on both sides leads to 
cos(40) = cos*@ — 6 cos76 sin?6 + sin*6, 


and 


sin(40) = 4cos@sin@ (cos*@ — sin7@). 
Similarly, 
cos(29) +isin(20) = (cos@ +isin@)* = cos? + 2cos@ (isin@) + (isin)? 
= (cos?6—sin76) + i2 sin cos 0. 
Separating out real and imaginary parts, we get well-known relations 
cos(20) = cos?6—sin76, 


sin(20) = 2sin@ cos 6. 
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PROBLEMS FOR CHAPTER 5 


1. Show that for any two complex numbers z, and z, 
|z, + z,|? + |Z, — z.|? = 2|z,|? + 2 |z,|? 
2. Given a complex number, 


_ 5 +i10 
** G=id@-i’ 


determine the real and imaginary parts of z. 


3. Determine the real part, imaginary part, and the absolute magnitude of the following three expressions, 


iaCilni Vi d Di (—) 
sin(ilni), i an 7m Tria)" 


4.\f z =x + iy, determine the real part, imaginary part and the absolute magnitude of the following three 


functions, 
(a) sinz, (b) cosz and (c) exp(iz). 
5. For two complex numbers z, = V3 + i and z, = —V2 + iv2, determine 
(a) |2,/22|* and (b) (2,/22)*. 
6. Find all roots of the equation z© — 1 = 0 and plot the results in a complex plane. 
7. The three cube roots of +1 are of the form 1, w and w?. Determine Re [w] and Jm [a]. 
8. The three cube roots of —1 are of the form a, a? and a>. Determine Re [a] and Im [a]. 
9. Determine the fourth root of z = —8 + i 8V3. 
10. Determine the third (or, cube) root of z = —2+i2. 
11. Use DeMoivre’s formula to express sin? 9 and cos‘*@ in terms of sine and/or cosine of multiples of 0. 


12. Use DeMoivre’s formula to express cos(3@) and sin(3@) in terms of powers of sin @ and cos @. 
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Chapter 6: Determinants 


In this chapter we will first define a determinant as well as its minors and cofactors. Next, we will outline a 
procedure, due to Laplace, for determining the value of the determinant in terms of its cofactors. Several 
properties of determinants will be described which help in simplifying the determinant so that it can be reduced 
and evaluated easily. Two final examples will help in understanding the utility of properties in the simplification 


process for a determinant. 
6.1 DEFINITION OF A DETERMINANT 


A determinant is a square array of numbers (or elements) such as 


G41 x2 aay Tn 
21 Az2 Gey Fan 
ey in Se 
Qin Aig ij Qin |’ 
Ant Ang + Anj “* Ann 


which are combined together according to certain rules to provide the value of the determinant. The element aj; 
is located at the intersection of i*” row and j*" column. The number of rows (or columns) is called the order of 
the determinant. The determinant shown above is of order n. We also define the minor and the cofactor of each 
element of a determinant. Starting with a determinant of order n, one can obtain a smaller determinant by 
omitting one row and one column. The determinant of order (n — 1) obtained by omitting the row and column 
containing aj; (that is, omitting if? row and j** column) is called the minor M;; of a;; and the minor multiplied by 


a sign of (—1)'7/ is called the cofactor C;; of a;;. 


Let us start evaluating some determinants. A single number can be considered as a determinant of order 1 and 


the value of this determinant is the value of the single number. 


Next higher order determinant is of order 2. Consider a determinant of order 2, 


Qi | 
Q21 422 


a= | 


According to the rules of determining the values of a determinant, the value of a determinant of order 2 is the 


difference between the products of elements along two diagonals as 


det(A) = 41422 — 21Q42 
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Note, if we interchange the rows and columns of this determinant to form a new determinant B, 


Q4, Ag, 
B=| } 


442 422 
then 
det(B) = det(A). 
Next higher order determinant is of order 3. A typical determinant of order 3 is 


Q11  Ay2 43 
Q21 422 423 
431 32 433 


A= 


Again, according to the rules of determining values of a determinant, the value of a determinant of order 3 can 


be written in terms of Levi-Civita symbols as 


det(A) = > €i jx i Yo4ez = €123911A22A33 + €132A11A32A23 + €213A21A4233 F €231421A32043 
ijk 


TF €3124314 2423 1 €321431A22013 = > Ej jk 241192 j;93k- 


ijk 


The last equality suggests that the value of a determinant of order 3 is not changed if its rows and columns are 
interchanged. In other words, if a determinant B is obtained from A by interchanging its rows and columns, 


then, 
det(B) = det(A). 
Substituting explicitly the values of all Levi-Civita symbols, we get 
det(A) = 44142233 — Ay1A32A23 — Az1Q4233 + Az1A32Q43 + A31A42A23 — Az1A22043 - 


Note that each product on the right-hand side contains only one element from each row and from each column 


of the determinant. Furthermore, factors on the right-hand side can be rearranged either as 


det(A) = a1 (22433 — A32423) — Az1(A1233 — A,3A32) + A31(G12423 — A43A22) 


a a a a a a 
_4yi141 |%22 S| _4y\241 | 12 | 4341 | 12 | 
ay1( 1) la A33 + Ap1( 1) a32 A33 a a31( 1) Az2 Az3 


y4(—1)*** My, + @g1(—1)?** Mp, + 31(—1)7**M3, 
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A414 Cyy + Aq1Co1 + A31C31, 


or as 


det(A) = a4, (22433 — A32423) — Ay2(A21433 — A31A23) + A13(Az1Q32 — A312) 


a a a a a a 
_4y141 |%22 | _4y142 | 21 = 4143 | 21 = 
a44( 1) las A33 + Qy2( 1) A341 a33 a 43 ( 1) a3, A32 


y4(—1)*** My, + ay2(—1)"*7My + a43(-1)* 7 My, 


A414 Cyy + Ay 2Cy2 + Ay3C)3. 


These latter forms suggest an alternate way, due to Laplace, of obtaining the value of a determinant of any order 


n as 


n 


det(A) — >: ainCi, 


i=1 
where the expansion of the determinant is across first column, or as 


n 
det(A) = » ay; Cij, 
i=1 
where the expansion of the determinant is across first row. It suggests that rows and columns of a determinant 
can be interchanged without affecting its value. In fact, the value of a determinant can be obtained by expanding 


across any row or any column. We illustrate this fact in the example below. 


Example: Determine the value of the following determinant of order 3, by expanding it across any row or any 


column. 


1 2 1 
A=]0 -1 2). 
1 1 O 


Solution: Expanding across first row, 


devay=(y-y | |+@cor|? J+a@coel) 7 


1 
= 1(0 — 2) —2(0 —2) +100 + 1) = 43. 


Expanding across second row, 
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det(A) = (0)(-1)?*" [4 al + (D(a? |} Ws cays |! i 


S001) 101) 90 2) = 43, 


Expanding across third row, 


gil? A peel 2 nant 2 
deta = (D-D |" +@Mcyo|, |+@Co*|, 4 
= 1(44+1)—1(2-—0) + 0(-1 + 0) = 43. 
Expanding across first column, 
me ee vole: A aoe At 
detay=(D-|5° lt+O@cv|, f+@cn|4, 3 


= 1(0 —2) —0(0-—1) +1(4+ 1) = 43. 


Expanding across second column, 


det(A) = (2)(-1)"*? |) “| + (-1)(-1)*? |} Ue acy |t u 


=-2(0-2) -1(0= 1) —12—0) =43. 


Expanding across third column, 


were cauae 1 <grea{l 2 eel 2 
deta =(D-D|, FY |+@cvrs|, f+@cn|, 4 
= 1(0+ 1) —2(1 — 2) + 0(-1 — 0) = 43. 
Thus, in general, 
» Din Cir = » Cyi Ae; = oF det (A) b Eq. (6.1) 
Kk Kk 


6.2 PROPERTIES OF DETERMINANT A 


There are several very useful properties of determinants which are helpful in determining the value of a large 
determinant. Since the value of a determinant does not change on interchanging the rows and the columns, all 


the remarks below for columns of a determinant will apply equally well to the rows of the determinant. 


As a shorthand notation for writing the determinant, let us write A, for the /‘” column of A. Then 
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A = |Aq, Ag) Ap An |: 


Property Number 1: If any two columns of a determinant A are interchanged, then the value of the resulting 


determinant B is same as the value of the original determinant except for an overall change of sign. In other 


words, if 

A= |A,, Az. Ay us Ams+-An |, 
and 

B= |Ay, Aq Am, Ap An I 
then 


det B = — detA. 


To prove this fact, we first assume that the /£” column and the m®" column of A are two adjacent columns, that 


is, 

A= |A,, A>, ...Apy Amy An | 
and 

B= |Ay,A2 Am Ap «An |: 


Now, to determine the values of these two determinants, let us expand each determinant across the column that 
contains elements of the original [{" column. Each element of this column will have the same minor both in A 
and in B; however, the cofactor of each element will differ in sign only since the *" column is moved from left of 
m* column to the right of m®”" column. So, det B = —det A. This result is true even if the J{" column and the 
m*" column of A are not adjacent columns. If the ‘” column and the m*" column are separated by j columns, 


such as 
A= |A,,A,..4,<—jcolumns >,A,,...An |, 


then a total of (27 + 1) interchanges of adjacent columns are required to exchange the I‘” column and the m‘”" 


column and since (—1)*/*? = (—1) for any j, the fact that det B = —det A is proved. 


Example: A determinant B is obtained by interchanging the first two columns of another determinant A as 


shown below: 
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1 -1 1 -1 1 1 
A=|2 3 -2]/andB=/}3 2 -2). 
3 1 -4 1 3 -4 


Show that det B = —detA. 


Solution: Expanding across first row, we get 


deta) =(y(- fF “4+ (pcp |? —2]4acn 2 


1(-12 + 2) + 1(-8 + 6) +1(2 —9) = -19 


and 


2 nS 
3 ne 


3 - 


det(B) pane 


(-1)(-1)""| 21+ (y(-| 21+ cys [3 ; 


1(-8 + 6) — 1(-12 + 2) + 1(9 — 2) = 419. 


Property Number 2: If a determinant has two identical columns, such as 


A= |AyAp Ay Ap An |; 


then detA = 0. 


The proof of this statement easily follows from Property Number 1, since on interchanging the two columns we 


get a negative sign, while the determinant stays the same since the two exchanged columns are identical. Thus, 


det(A) = — det(A). 


det(A) =0. 


Example: Two columns of a determinant A are identical. Determine the value of this determinant. 


1 -1 -1 
A= |2 3 3 |. 
3 #1 1 


Solution: Expanding the determinant across first row, we get 
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det(a) = (1(-D | ‘|+ (pcp [2 314 (cy 3 


= 1(3 —3)+1(2-9) -1(2-9) =0. 


Property Number 3: If each element of a column of a determinant is multiplied by a constant k, then the value of 


the whole determinant is multiplied by the same constant k. Or, saying differently, multiplying a determinant by 


a constant k amounts to multiplying any one of the columns of the determinant by k. 


Suppose determinant B is constructed by multiplying the J” column of determinant A by a constant number k. 


Then 
A= |A,, Az... Ay An | 
and 
B= |A,,A,...KA),...Ay |. 
Also, 
n 
det(A) = » aj C; 
i=1 
and 


n 


det(B) = > Gea Cu = ky ayCy = kdet(A). 


t=1 t=1 


Example: A new determinant B is formed by multiplying the third column of another determinant A by a 


constant number k = 5, as shown: 


1 -1 1 1 -1 5 
A=|2 3 -2]/andB=|2 3. -10). 
3 1 -4 3 1 —20 


Show that det B = k det A. 


Solution: Determinant A is the same determinant as in the previous example. Its value, as determined 


previously, is —19. 


Now, expanding the determinant B across the first column, we get 
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ce@ oF Mrocve ft S|+@co[e 3) 
= 1(-60 + 10) — 2(20 — 5) + 3(10 — 15) = —95 = 5(-19). 
Property Number 4: If each element of the J column of a determinant A can be expressed as 
Qj = apy, + PQ; with j = 1,2---n, then, columnwise, 
A, = aP, + BQ). 


Now, expanding the determinant A across its £” column 


det(A) = det|A,A, --- A)_, aP; + BQ) Aj41 -** An| 


n n n 
= > (arn + Baj)Cu = a) PC +B) qj 
j j=l j=l 


j=l 
= adet|A, A, ++ Aj_y Py Aisi °° An| + B det| AyAg ++ Aj_1 Q) Ai41 °° Anl 
=a det(P) +f det(Q). 

Note, the determinants A, P and Q share all columns except the /*" column. 


Example: Two new determinants, P and Q, are created by splitting the third column of another determinant A, 


as shown: 


i oes a | 1-11 1-1 0 
A=|2 3 -2| withP=|2 3 -1}andQ=|2 3 -41|. 
3 1 -4 a en) gad) 2 


Show that det(A) = det(P) + det(Q). 


Solution: Determinant A is the determinant of previous few examples. Its value, as determined previously, is 


—19. 


Now, expanding the determinants P and Q across the first row, we get 


3 0- 2 - 


det(P) 1 =| + (-1)(-1)1*? ie * + (1)(-1)1*3 E | 


(ay(-1)"** | 


1(-6 +1) +1(-4+ 3) +1(2-9) = -5 -1-7 = -13, 
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det(@) = (D(-)"*"|$ + Coco | “a + occa | | 


= 1(-6+1)+1(-44+3)-0=-5-1=-6. 
Clearly, det(A) = det(P) + det(Q). 


Property Number 5: If a determinant B is obtained from determinant A by adding to the J” column of Aa scalar 


multiple of any other column of A, then det B = det A. 
Explicitly, 
B= |A,A, °° A; + AA, +++ Am °° An|: 
Using Property Number 4, 
det B = det|A,A,°++ A, ++: Am ++ An| + @ det|A, Az -*+ Am +? Am *** An|- 
Or, using Property Number 2, 
det B = det|A, A, -+ A, -** Am ++: A,| +0 = detA. 


Example: A new determinant B is created by adding first column to the third column of another determinant A 


as shown: 


1 -1 1 1 -1 2 
A=/2 3 -2|andB=|2 3 0 |. 
3 1 -4 3 1 -1 


Show that det B = detA. 


Solution: Determinant A is the same determinant as in the previous examples. Its value, as determined 


previously, is —19. 
Now, expanding the determinant B across the third column, we get 


det(B) = (2(-1)'* |? “+ —pe|2 {+ Cncn |, | 


3 
= 2(2-9)-0-1(3+ 2) =-14-5 =-19. 


Finally, we provide two examples in which we use properties of determinants to first simplify the determinant, 


then evaluate it. 
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Example: Using properties of determinants, find out the value of 


1 -1 1 
A=/2 3. -2I. 
3 1 -4 


Solution: We have already seen that the value of A is —19. Now we will use various properties of determinants 
to create as many zeros ina single row or ina single column as possible. We will do so for the first row of 


determinant A. 


First, we add column 2 to column 3 and use property number 5 to get 


1 -1 1 1 -1 O 
2 3 -2)= |2 3 1 
3 1 -4 3 1 -3 


Next, we add column 1 to column 2 and use property number 5 again to get 


1 -1 1 1 -1 0 1 0 0 
2 3 -2)= |]2 3 1/=|2 5 1 
3 1 -4 3 1 -3 3 4 -3 
Now, we expand across row 1 to get, 
1 -1 1 1 -1 O 1 0 0 5 4 
2 3 -2/=|2 3 1/=|2 5 1/=1[7 “I 
3 1 -4 3 1 -3 3 4 -3 
The determinant of order 2 is easy to evaluate as 
1-1 1 1 -1 O 1 0 5 4 
2 3 -2/=[2 3 1/=|2 5 1/=1[3 *|=-19 
3 1 -4 3 1 -3 3 4 3 


As seen in the above example, the trick of simplifying the evaluation of a determinant is to create as many zeros 
in a single row (or column) as possible and then use this row (or column) to evaluate the determinant. 

a 
d 


Example: Given that = 5, use the properties of general determinants to find the value of 


a 


S 


—2a 3d-g gta 
—2b 3e-h h+Di. 
—2c 3f-i ite 


D= 


Solution: The determinant to be evaluated is 
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—2a 3d-g gta 


—2c 3f—-i itc 


First, take —2 common out of column 1 to get 


a 3d-g gta 
D=(-2) 


c 3f-i ite 


Now, subtract column 1 from column 3 to get 


a 3d-g g 
D=(-2)|b 3e-h Al. 
c 3f-i i 
Adding column 3 to column 2 gives 
a 3d g 
D=(-2)|b 3e Al. 
c 3f i 
Taking a common factor of 3 out of column 2, we have 
a g 
D=(-2)(3)|b e Al. 
Cc i 


Finally, interchange rows and columns of the determinant, 


—2b 3e-h h+b}. 


b 3e-h h+b}. 


= (—6)(5) = —30. 


123 


PROBLEMS FOR CHAPTER 6 


1. Determine the values of the following two determinants 


5 2 3 —2 3 #5 
-1 3 4 and 1 2 4 
0 -2 3 0 3 -2 


2. Using the properties of determinants, first simplify and then determine the value of the following 


determinant, 


a—b a*—b? a—b3 
b-c b*-c? b?—c3 


c-a c?-a@ c-da3 


3. Without expanding the determinant, and using the properties of determinants only, show that 


bc ata bc ab ca 
b* ca b?|=Jab ca_ bel. 
c* c* ab ca bc ab 


4. Without expanding the determinant, and using the properties of determinants only, show that 


na, +b, na, +b, na, +b, ay b, c 
2 

nb, +e, nb, +c, nb, +c, =(n+ 1) (n* —n+4+1) a, b, C, 

ne, +a, nc, +a, nce, +a, a, b, c, 


5. Without expanding the determinant, and using the properties of determinants only, show that 


a~ (a+ 12 (a+ 2/2 l 0 0 
b2 (b+1)2 (b +2) = —4(a—b)(b—c)(c—a) 0 1 0 

0 0 1 
ce (ett% ~~ (e +27 


124 


Chapter 7: Matrices 


In this chapter we will learn about matrices and their properties. Just like determinants, matrices are also arrays 
of numbers or elements, but that is where the similarity between a determinant and a matrix ends. A 
determinant, by design, is a square array of numbers while a matrix, in general, is a rectangular array of 
numbers. With determinants, the elements are combined together to provide a single number, which is the 
value of the determinant. On the other hand, in the case of a matrix there is no concept of a single value of the 
matrix available by any combination of its elements. However, only for a square matrix, one can mention a 
corresponding determinant whose elements are the same as the elements of the square matrix, or one can 
loosely talk about determinant of a square matrix. As an application of matrices, in this chapter we will explore 


their use in solving n coupled inhomogeneous algebraic equations. 
7.1 A GENERAL MATRIX 


A general matrix is a rectangular array of numbers or elements. For example, 


G41 Ay2 "An 

421 422 ** Aan 
A= : : . : 

Qm1 GAm2 °° Amn 


is a matrix with m rows and n columns. It is common to say that A is an m X n matrix. The element aj; is located 
at the intersection of i*” row and j*” column of the matrix. If A = B, then all elements of matrix A are individually 
equal to all elements of matrix B; that is, a;; = b;; for all i and j. Clearly, if A is an m x n matrix, then so is B. 
Matrices consisting of only one row or one column are referred to as vectors. For example, an m X 1 matrixisa 


column vector of dimension m, and a 1 X n matrix is a row vector of dimension n. 


Various matrices combine (add, subtract or multiply) according to some well-defined rules. First, in order to add 
or subtract two matrices, they must be of the same size; that is, must have same number of rows and same 
number of columns. Furthermore, matrices add or subtract element-by-element. For example, if C = A+ B, 


then c;; = aj; + bj. Matrix addition is commutative as well as associative, that is, 


j 
A+B=B+AandA+(B+C) =(A+B)+C. 


Multiplication of two matrices, A and B, leads to a new matrix C, that is, C = AB. The element c;,, located at the 


jr 


intersection of i*” row and j*” column of matrix C, is given by 
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Cij = > aides: Eq.(7.1) 


k 


In other words, element c;; of C is obtained by multiplying, element-by-element, the i™ row of A with the j** 
column of B, and then adding all the products. Note that for the multiplication of two general rectangular 
matrices to have any meaning, the number of columns of A (namely, dummy index k in the sum in Eq. (7.1)) 
must be equal to the number of rows of B (again, the same dummy index k in the same sum). So, if A is an 

Mm X p matrix and B is a p X n matrix, then the product C is an m X n matrix, while the product BA is undefined. 


However, as long as matrix multiplication is allowed, it is both distributive and associative, that is, 
A(B + C) = AB+ AC, 
(AB)C = A(BC). 


The use of the word vector for matrices with only one row or one column is justified since multiplying a 1 x 3 
row matrix containing elements A,,A,, and A, with a 3 x 1 column matrix containing elements B,, B,, and B, is 


equivalent to taking the scalar or dot product of two vectors. Explicitly, 
(Az Ay Az)| By |= A,B, + A,B, + A,B, =AeB. 


Multiplication of a matrix A by a constant k simply multiplies each and every element of A by k; that is, 


kay, Kay. + kayn 
yee As EL Eq.(7.2) 
Kami kam2 + Kamn 


If the number of rows and the number of columns of a matrix are equal, say to n, then the matrix is a square 
matrix of order n. For two square matrices A and B of the same order, one can form the products AB as well as 


BA. However, in general, AB # BA. If AB = BA, then the matrices A and B are said to commute. 


Example: Consider two square matrices A and B of order 2, 


_(0 1 _(-1 0 
= G a a ( 0 v 
Check whether the products AB and BA are equal or not. 


Solution: The products AB and BA are 
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Clearly, in this case AB # BA. 


r) 
Example: Given two square matrices, A = G ) and B = e ), determine whether they commute or 


not. 
Solution: In this case, 
aB=(F ale a ae oe) 
“NB al\6 vy)” \By+aé5 BS+ay/’ 


5 We a= (ass py pst re) 


Since AB = BA, then A and B commute. 


A matrix with all of its elements as zero is called a null or a zero matrix and it is denoted by ©. We note in passing 


that, in general, if AB = O, it does not imply either A = O or B = O. As an example, consider two matrices, 


Their product is 


Note that the product AB is a zero matrix while neither A nor B is a zero matrix! 


7.2 PROPERTIES OF MATRICES 


From this point onwards, we will focus our attention only on square matrices. First, we define some special 


square matrices. A square matrix whose elements are all zero except the elements along the diagonal is called a 


diagonal matrix. For such a matrix A, a;; = O if i # j, and it looks like 


A441 0 aco (9) 
A=( ° ae) Eq.(7.3) 
0 0 hes 
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In particular, a square diagonal matrix which has only 1 as its diagonal element is a unit or identity matrix 1, 


1 0 0 
l= [eae eee Eq.(7.4) 
0 0 noe 1 


Thus, various elements of the unit matrix can be written in terms of a Kronecker delta as, a;; = 6;;. Multiplying 


any general matrix A, either on the left or on the right, by 1 gives back the matrix A. So, 


Transpose of a Matrix (A) 


A square matrix obtained by interchanging rows and columns of a matrix A is its transpose, A. (Another common 


notation for the transpose of a matrix which you may encounter is A’.) Thus, 


(A), => (A) ji or ai; => Aji. 


Obviously, (A) = A. There is an interesting property of transpose of a product of two general square matrices. 
The transpose of a product of two matrices is equal to the product of transposes of the two matrices in reverse 


order, that is, if C = AB then C = BA. Asa proof, note 


Ci = ys inde; = ». Ani Dix = by Biri = (BA) ,,. 
k 


k k 
Also, Cj; = ci; by definition of transpose. Thus Cj; = (BA) ,, or C= BA. 
Symmetric and Antisymmetric Matrices 


A matrix A for which A = A is called a symmetric matrix since for such a matrix aj; = a;;. On the other hand, a 


matrix A for which A = —A is called an antisymmetric matrix since for such a matrix aj; = —aj;. It implies that 


the diagonal elements a;; of an antisymmetric matrix are zero. Now, for any general square matrix A, A + Ais a 


symmetric matrix while A — Ais an antisymmetric matrix. It is so because 
ii + Aji = (A + A), 


Since, for any general matrix A, we can write 
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a ~ Wn _ 
A= (A +A) + (A A), 
it means that any general matrix can be written as a sum of a symmetric and an antisymmetric matrix. 
Inverse of a Matrix (A~*) 


lf AB = BA = 1 (unit matrix), then matrix B is the inverse of matrix A and matrix A is the inverse of matrix B. 
Thus B = A? and A = B‘?. Not every square matrix has an inverse; however, the inverse of a matrix, if it exists, 
is unique. As a proof of the uniqueness of the inverse of a matrix, let us suppose that there are two different 
matrices B and C which are inverse of the same matrix A. Then AB = 1 and CA = 1. Now consider the triple 


product of matrices, CAB. Using the associative nature of matrix multiplication, it can be rewritten as, 
CAB = (CA)B = C(AB) or (1)B = C(1) or B=C. 
Thus, the inverse of A is unique. 


There is an interesting property of the inverse of a product of two general square matrices. The inverse of a 
product of two matrices is equal to the product of inverses of the two matrices in reverse order; that is, if C = 


AB, then C7"! = B-1A‘1. As a proof, notice that 
CC = B-1A7AB = B-1B = land CC"! = ABB 1A1= AlA? = 1. 
Since the inverse of a matrix is unique, C~? is the true inverse of C. 
Adjoint of a Matrix (A*) 


In preparation for describing a technique for obtaining the inverse of a square matrix, we first define the adjoint 
of a square matrix. Note that for a square matrix A we may think of its determinant as the corresponding 
determinant whose elements are identical to the elements of A and whose value is det(A). Thus, if 

Ais ann X n square matrix, its adjoint matrix is obtained by first replacing each element a;; by the cofactor C;; 
of the corresponding element in the determinant det(A), with appropriate sign, and then transposing the 


matrix. Specifically, adjoint of A is 


Cy C, 1 Chi 
A“ = C2 C22 Cho 
Cin Con ne Cnn 


where C;; is the cofactor of a;;. From its construction, we infer (A); = C);. Using Eq. (6.1), then 
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(AA*);; = > AieAD es = > Aix Cjx = 5;; det (A). 
K 


k 


or, 
AA® = det(A) 1, 
where 1 is the unit matrix. 


Similarly, using Eq. (6.1) again, 
(ACA)y = ) (AD xA)ey =) Ceitay = 5iy det (A). 
k Kk 


Thus, 
AA® = A*A = det(A)1. 


This can be rewritten as 


(sercay)*= 4 (aera) = 
so that 


A = as Eq.(7.5 


Note that if det(A) = 0 then A“? is undefined. In other words, the condition for A~? to exist is that det (A) # 0. 
A matrix A for which det (A) ¥# 0 is referred to as a nonsingular matrix while a matrix for which det(A) = Oisa 


singular matrix. Thus, the inverse of a singular matrix does not exist. 


Example: Find the inverse of the matrix 


using the adjoint method. 


Solution: First, we note that det A = +1(—9) + 2(4) = —1 ¥ 0, so that the matrix A is nonsingular and its 


inverse exists. 
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The cofactors of various matrix elements are 
C14 = —9, Cio = —6, Cy, = +4, 
Cy, = +4,C,, = +3, C3, = —2, 
C3, = +2, Cap = $1,053, =—1. 


The adjoint matrix is 


Finally, the inverse of A is 


i 9 -4 -2 
At =e -3 -1). 
det(A) (s g 7 


Just to verify that our inverse is correct, we multiply the original matrix A by its inverse matrix A7+ and make 


sure that we get the identity matrix, 1. 


9 -4 -2\/1 0 2 1 0 0 
ATA = ( 6 -—3 -1) (: -1 :) = (: 1 | =1. 
—-4 2 1 0 2 3 0 0 1 


Example: Given the matrix 


find A-! using the adjoint method. 


Solution: The determinant of this matrix is det A = +1(6) — 2(1) = 4 # 0, hence the matrix is nonsingular. The 


cofactors of various elements a;; are 
Cy, = +6,Cyy = —4,Cy3 = +2, 
Co, = —1,Cy2 = +2,0C23 = —1, 
C3, = —9,C32 = +6,C33 = —1. 


The adjoint matrix is 
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and the inverse matrix is 
Ae ( 6 -1 ) 
At =——_=-|[-4 2 6 |. 
det(A) 4 > 1-1 


7.3 SOLVING INHOMOGENEOUS ALGEBRAIC EQUATIONS 


The idea of matrix inversion is very useful in solving a set of algebraic equations. Consider the following set of n 


algebraic equations containing n unknown variables x, X,°+*, Xn, 
Ay 4Xy + AyQXp Fo + AynXy = Vip 


Az1X1 + Az2X2 +++ FAgnXn = Vo, 


Ani X1 + an2x2 H+ Ann Xn = Yn: 


Here all the coefficients a;; as well as the numbers on the right-hand sides of these equations, namely y,,, are 
known. If all the numbers y,, are zero, then these algebraic equations are called homogeneous algebraic 
equations. On the other hand, if all or some of the numbers y,, are nonzero, then these algebraic equations are 


called inhomogeneous algebraic equations. The equations can be rewritten in a matrix form as 


Qi, Ay2 Qin xy V1 
421 A22 a2n x2 \_ | Y2 
: = , 
Qn1 Qn2 Ann Xn Yn 
or 
AX = Y, Eq. (7.6a) 


where X and Y are column matrices (or vectors), Y is known and X is unknown. Then, using the idea of matrix 


inversion, 


a 


X=A1lY = ——_Y. 
det(A) 


Eq.(7.6b) 
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Example: Solve the following three inhomogeneous algebraic equations of three variables, x, y, and z using 


the inverse of a matrix. 
x+2y+3z=4, 
2x+3y =8, 
y+2z=12. 


Solution: Rewriting these equations in a matrix form, we get 


1 2 3\ /x 4 
ax=(2 3 o)(r)=()=» 
0 1 2/ \z 12 


The inverse of this matrix A was determined in the previous example. Using it, we get 


x 4\ 4/6 -1 -9\/4 
()-a(8)-H{-+ 2 48)(8) 
z 12 2 -1 -1/\12 
6 -1 -9\/1\ /-23 
-(-+ 2 48)(2)-(18) 

2 -1 -1/\3 —3 


Thus, x = —23,y = 18 and z = —3. 


In case of homogenous algebraic equations, Y = 0. Then, if det(A\) # 0, we get the solution X = 0, which is 
called the trivial solution. A nontrivial solution of homogenous algebraic equations is available only 


when det(A) = 0. 
7.4 SPECIAL MATRICES 
Now, we introduce some well-known and useful matrices. An orthogonal matrix A is the one that satisfies 
AA = AA=1. 


Clearly, for the orthogonal matrix, 


As an example of an orthogonal matrix, consider 
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= eu aa 
—sina cosa/ 


For this matrix 


Aa(O88 Fone 
sina cosa 


and 


Ana ( S22 puna) ie Pas 


—sina cosa/\sina cosa 


as well as 


AA= = 


eae eee Oo) 
sina cosa —sina cosa 


So, we conclude that A = A~! and A is an orthogonal matrix. 


If the elements of a matrix consist of some complex numbers, then we can obtain the 
complex conjugate matrix, A*, by simply replacing each element of matrix A by its complex conjugate. Thus, if 
(A) ij = a; 


j then (A*);; = aj;. As an example, consider 


Its complex conjugate matrix is 


At = ( 2 a 


2+1 3 


Now, we will introduce Hermitian and unitary matrices. First, we define the Hermitian adjoint of a general 
matrix A. It is denoted by the symbol f (called dagger). The Hermitian adjoint is obtained by first taking the 


complex conjugate of the matrix, followed by taking the transpose of the matrix. Thus, 


At = A* 


is the definition of the Hermitian adjoint of A. Now, a matrix A is a Hermitian matrix if its Hermitian adjoint is 


equal to the matrix itself. In other words, if 
At =A, 


then A is a Hermitian matrix. 
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Next, a matrix A is a unitary matrix if its Hermitian adjoint is equal to the inverse, A~+, of the matrix. In other 


words, if At = A-!, or if 
AAt = AtA=1, 
then A is a unitary matrix. 


As an example, consider the matrix 


1 a b —ic 
Aa eee a) 


with a, b,and c real. One can even visually see that At = A for this matrix, so that it is a Hermitian matrix. 


Furthermore, one can easily check that this matrix satisfies 
AAt = AtTA=1, 

so that this matrix is also a unitary matrix. 
7.5 CHARACTERISTIC OR SECULAR EQUATION OF A MATRIX 

If Ais ann X n matrix and A is a scalar parameter, then 

K=A-Al 
is called the characteristic matrix of A. The equation 
det(K) = det(A — Al) = 0 


is called the characteristic or secular equation of A. If Ais ann X n matrix, then the secular equation is of the 


form 
fA =14+aAt+a A? 4+--+a,4" =0 


where the coefficients a;(i = 1, ...n) depend on the elements of A. The n roots of this algebraic equation 


A,,A5, «+++,A,) are called the characteristic roots of the matrix A. 
1972 nN 


Eigenvalues/Eigenvectors of Matrices 


Given an X n matrix A, there exist certain column matrices (or vectors) X such that AX is a simple multiple of X. 


In other words, 
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AX = AX Eq. (7.7) 


where J is a number (real or complex). Then J is said to be an eigenvalue of A belonging to the eigenvector X. 


Here X is a column matrix with n rows, 


Thus, Eq. (7.7) gives 


Or, 


n 
IG = Gi; )x; = 0. 
j=1 


These are n homogenous algebraic equations, for i = 1, 2, ...n, in variables x1, X>,...,X,. A nontrivial solution 


exists for these equations only if 


det(A — All) = 0. 


This, of course, is the secular equation of matrix A. The characteristic roots of A are, then, the eigenvalues of A, 


and the corresponding vectors X are the eigenvectors. We will briefly note that eigenvalues and eigenvectors 


quite useful in quantum physics and this fact will be mentioned again in Chapter 13. 


Example: Find the eigenvalues and eigenvectors of the matrix 


03 4 
A= (: 0 0} 
4 0 0 


Solution: The characteristic equation is det(A — All) = 0, or 
A 3 4 
det} 3 -A O ]=0. 
4 0 -a 


On expanding the determinant, we get 


are 
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~43 +92 +164 =0 or AA+5)(A—5) =0. 


A=0,+5,—5. 


These are the three eigenvalues of matrix A. Next, we determine the eigenvectors one-by-one. For 2 = 0, AX = 


AX gives, 


3X2 + 4x3 = 0,3x, = 0,4x, = 0. 


Thus, within an arbitrary constant, the eigenvector is 


A normalized eigenvector is analogous to a unit vector since they both have magnitudes of 1. To normalize any 


eigenvector, we simply divide it by the magnitude of the eigenvector. So, for A = 0, the magnitude of the 


eigenvector is ,/0 + 47 + (—3)? = 5, and, therefore, the normalized eigenvector is 


For = +5, AX = AX gives 


—5x, + 3X2 + 4x3 = 0 


3 
+3x, — 5x, = Oorx, =e 
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4 
+4x, — 5x3 = Oorx3; = 5 


Again, within an arbitrary constant, the eigenvector is 


5 
F . , 1 
and the normalized eigenvector is (3) 


Finally, fora = —5, AX = AX gives 


+5x1, + 3X2 + 4x3 = 0 
3 
+3x, + 5x, = Oor x, Se x4 


4 
+4x, + 5x3 = Oorx3 = 5%: 


So, the eigenvector is, within an arbitrary constant, 


5 
‘ r 1 
and the normalized eigenvector is Z(-3} 
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PROBLEMS FOR CHAPTER 7 


1. Given the following two matrices A and B, determine AB, BA, A?, and B? if they exist. If a product does not 


exist, make a note of it. 


2. Given the three matrices, 


0 1 0 0 0 0 0 0 1 
a=(0 0 o), a= (0 0 i) c= (1 0 o), 
0 0 0 0 0 0 0 0 0 
find all possible products of pairs of matrices (either A and B, or B and C, or C and A ) to determine which 


pair, or pairs, of matrices commute. 


3. For the following matrix A, determine A“, 


Explicitly verify that AA"? = A*A= 1. 
4. Determine the eigenvalues and the normalized eigenfunctions of matrix A of problem 3. 
5. Solve the following inhomogeneous algebraic equations by finding the inverse of the matrix of coefficients, 
x-ytz=4 
ax +y-—z= —-1 
3x +2y+2z=5. 
6. In the theory of spin - in quantum physics, we encounter three 2 x 2 matrices (Pauli spin matrices), 
a=G 9 a DeanG 4) 
Prove the following properties of these matrices: 
(a). 0,0, =i 24 Gan OG, for l =m. 


(hier =a) =o7=1= ¢ ah where 1 is the unit, or identity, matrix of size 2 x 2. 


0 1 
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7. Determine the eigenvalues and the normalized eigenvectors of the three Pauli spin matrices of problem 6, 


separately. 
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Chapter 8: Vector Analysis 


In biomedical physics we encounter several physical quantities. Some of these quantities are characterized by 
their magnitude only, such as temperature of a sick patient, density of a blood sample, pressure in the lung, etc. 
Such quantities are called scalars and they are treated as ordinary algebraic numbers. On the other hand, there 
are physical quantities which are characterized by both magnitude and direction, such as forces on human spine 
during heavy lifting, velocity of blood flowing in veins, electric and magnetic fields in medical equipment, etc. 


Such quantities are called vectors. In this chapter we will learn about mathematical properties of vectors. 
8.1 VECTORS AND THEIR ADDITION/SUBTRACTION 


Vector quantities are represented by directed lines whose lengths are proportional to the magnitude of the 
physical quantity and whose direction is the direction of the physical quantity. These directed lines look like an 
arrow with one end as the head of the arrow and the other end as the tail of the arrow, as shown in Figure 8.1. 


Vectors are shown as bold-faced letters like A and their magnitude is shown as 


head 


tail 


Figure 8.1. A vector, A, with its head and its tail. 
ordinary letters like A. We can move a vector, or the directed line representing the vector, around in space and 
the vector will remain unchanged as long as both the length of the directed line as well as its direction are kept 


unchanged. 


An alternate way of representing a vector is through its components. If the Cartesian coordinates of the head 


and of the tail of a vector A are (Xp, Yn, Zp) and (%;, Vz, Zz), respectively, then, the components of A are 
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The length of the vector is 


A= [a2 + a2 + Az]. Eq. (8.14) 


It is easier to see that if the origin of the coordinate system is chosen at the tail of the vector A, then 


A,, Ay, and A, are merely the projections of A onto the three coordinate axes. 


Addition and Subtraction of Vectors 


Vectors do not add or subtract or multiply like ordinary numbers. First, we will learn the rules for adding and 
subtracting vectors, which lead to new vectors. Next, we will learn the rules for multiplying vectors, which can 
lead to either a scalar (scalar product) or a vector (vector product). The vectors can be added using a geometrical 
method, or head-to-tail method, in which we connect the head of first vector, A, with the tail of the second 
vector, B. A new vector X whose head is the free unconnected head of B and whose tail is the free unconnected 


tail of A is the sum of vectors A and B, as shown in the Figure 8.2 


X=A+B. 


Figure 8.2. Vector X is the sum of vectors A and B. 


Figure 8.3. Commutative nature of vector addition. 


By completing the parallelogram, as in Figure 8.3, it is seen that 


X=B+A 
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as well. Thus, vector addition is commutative. Addition of more than two vectors proceeds in the same manner. 


IfZ=A+B+C, then from the Figure 8.4 
X=A+B, 
Y=B+C, 
and 


Z=X+C=A+Y, 


B 


Z 


Figure 8.4. Sum of more than two vectors. 


That is, 
Z=(A+B)+C=A+(B+C). 


Thus, vector addition is associative as well. Now, let us talk about subtraction of one vector from another. Figure 
8.5 shows two vectors, A and B, of equal length but with opposite directions. According to head-to-tail method of 


adding vectors, A+ B=0 orB=—A. 


In other words, —A (or B) is a vector which has same magnitude as A but its direction is opposite to that of A. 
Subtraction of one vector from another can now be handled exactly in the same manner as vector addition 


except the sign (direction) of one of the vectors is reversed. For example, 


A—B =A +(-B). 
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Figure 8.5. Two vectors A and B of equal length, but opposite directions. 


Figure 8.6. Subtraction of two vectors. 


Also, graphically the “other” diagonal of the parallelogram in Figure 8.3 gives A — B as shown in Figure 8.6. 


An alternative way of adding vectors is by using the components of the vector. For this purpose, we choose a 
right-handed Cartesian coordinate system. The meaning of a right-handed coordinate system will be clearer after 
Eq. (8.6). For this coordinate system, we introduce unit vectors e,, ey, and e,. These three vectors are each of 
unit length and are directed along the x, y, and z axes, respectively. We note in passing that several other 
authors use the notation i, j, and k or &, ¥, and Z for these Cartesian unit vectors. In our notation, then A,e, is a 
vector of length A, along the x-axis, Aye, is a vector of length A,, along the y-axis, and A,e, is a vector of length 


A, along the z-axis. Using vector addition by head-to-tail method, as seen in Figure 8.7, 


A= A,e, + Aye, + A,e,. Eq.(8.1b) 


Figure 8.7. Components of a vector A. 
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Now, two vectors A and B can be added simply by adding their components, that is, 


A+B= (e,Ax + eva, + e,A,) ae (e,B, oF eB, ar e,B,) 


e,(Ay + By) + ey(Ay + By) + e,(A, + B,). Eq. (8.2a) 


Subtraction of one vector from another can now be handled exactly in the same manner as vector addition 


except sign (direction) of one of the vectors is reversed. For example, 


A-—B=A-+4 (-B) = e,(A, — B,) + e,(A, — B,) + e,(A, — B;). Eq.(8.2b) 


It follows from the discussion of addition and subtraction of vectors that if two vectors are equal to each other, 


then individually the x, y, and z components of those two vectors are also equal. 
8.2 MULTIPLICATION OF VECTORS: SCALAR PRODUCT 


Now, we talk about multiplying two vectors. There are two ways to multiply vectors. One way leads to a scalar 
and the other to a vector. The scalar (or inner or dot) product of two vectors A and B is a scalar number of 
magnitude AB cos 0, where A and B are the magnitudes of the two vectors and @ is the angle between the two 
vectors. This angle is obtained by joining the tails of both vectors as in Figure 8.8. The scalar product of two 
vectors is indicated by placing a large dot between the two vectors, AeB = AB cos 0 = BeA. Note that the scalar 


product of two vectors is commutative. The scalar product is also distributive, that is, 


A 


Figure 8.8. Scalar product of two vectors A and B. 


Ae(B + C) = AeB + AeC. Also, since unit vectors e,, ey, e, are mutually orthogonal, it follows that. 
e, ee, = 1, eyee,=1, e-ee,=1 


and 
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e, ee, =eyee, = 0,e,°e, =e, °e, =0,e,ee, =e, ee, = 0. 
In particular, note 
AeB = (e,A, + eA, + €,4,) * (€,B, + eyB, + eB) 
= +A,By(€, * €,) + AyBy(0) + AzB,(0) 
+A,By(0) + AyBy(ey * ey) + A,B, (0) 
+A,,B,(0) + AyB,(0) + A,B,(e, © e,) 
or 


AeB=A,B, + AyBy + A,B,. Eq.(8.3) 


Also, 
AeA= Ax + Ay + AZ 
so that VAeA gives the magnitude of vector A. 


Now we introduce an alternate, but useful, new notation. We will use e;, ez, e3 for the three orthogonal unit 


vectors €,, ey, and e,, and A,, A>, and A; for the three components Ay, Ay, and A, of a vector A. In this notation 


it follows that 


ee; = On Eq. (8.4) 


Using this identity, we can write the scalar product of two vectors in terms of a Kronecker delta as 


AeB -yie Aj -> 68 => >. 6 A,B; => va B;. 
i j ij i 


Finally, the law of cosines in trigonometry, which relates the lengths of three sides of a triangle, can be 
easily obtained using scalar product of two vectors. Three vectors A, B, and C, form the sides of a triangle as 
shown in the Figure 8.9. The angle 6 between the two vectors A and B is obtained by joining the tails of both 


vectors. Using head-to-tail vector addition, 
C=A+B, 


or 
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C2 =CeC=(A+B)°(A+B)=AeA + BeB +2AeB = A? + B* +2 ABcos 0 
or 


C? = A? + B? —2 ABcosd Eq.(8.5) 


Figure 8.9. Law of cosines. 


where @ is the angle inside the triangle opposite to side C, that is, the angle supplementary to @. This is the law 


of cosines in trigonometry, and it is consistent with the Pythagorean theorem for a right-angled triangle (when 


= 1/2). 


8.3 MULTIPLICATION OF VECTORS: VECTOR PRODUCT 


Now, we talk about the second way of multiplying two vectors which leads to a vector. The vector (or cross) 
product of two vectors is indicated by placing a large cross between the two vectors A and B. It is defined to be a 


new vector C, 


C=A xB, 


such that the magnitude of Cis C = ABsin@ and direction of C is perpendicular to the plane containing 

A and Bin such a sense that A, B, and C form a right-handed system. In order to understand the direction of C 
better, let us imagine that the plane containing A and B is a big wall. We place a screwdriver, with a screw, 
against the wall such that the screwdriver can be rotated either clockwise or counterclockwise and the screw 
itself will move either into the wall or out of the wall. In the Figure 8.10 (a), the screwdriver is rotated 
counterclockwise, as shown by the arrow on angle @, from A to B, and the direction of A xX B, which is along the 
direction of motion of the screw itself, is coming out of the wall, indicated by symbol ©. In Figure 8.10 (b), the 
screwdriver is rotated clockwise along the arrow shown on angle @, from B to A, and the direction of B x A, or 


the direction of motion of the screw itself, is going into the wall, indicated by symbol ®). This is referred to as the 
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right-hand rule. With this definition, the vector B x Ahas the same magnitude as vector A x B; however, 


direction of B x Ais opposite to that of A x B. Thus, 
A xXB=-B XA, 
that is, the cross product is anticommuting. In the case of the three unit vectors e,, ez, and e3, 
e, Xe, =0, e, X e, = 0, es X e3 = 0, 
€, Xe, =e3 = —€2 Xey,e2 X €3 = ey = —€3 Xp, e3; X ey =e, = —e€, Xz. 


In a more compact form, we can write all of these nine relationships as 


3 
e,xXe,= Gijxn Ck: Eq. (8.6) 
J », J 
AxB BxA 
© ® 
B B 
@ @e 
\ 
A A 
(a) (b) 


Figure 8.10. Vector product of two vectors A and B. 


The three unit vectors, satisfying Eq. (8.6), describe a right-handed Cartesian coordinate system. Using these 


relations, the components of C = A X Bcan be written as 
C=A XB= (e,A, + e2A, + €3A3) X (€,B, + €2B, + €3B3) 
= e3A,B, — e2A,B3 — €3A,B, + €1A,B3 + €2A3B, — €1A3B, 
= e;(A,B3 — A3Bz) + €2(A3B, — A,B3) + €3(A,Bz — A2B,). 
Thus, 
C, = A,B; — A3B, 


C, = A3B, — A,B; 
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C; = A,B, — A,B, 
or 
C; = Aj By = A,B; 


with i, j, and K all different and in cyclic permutation. In terms of Levi-Civita symbol, we can write the 


components of C as 


C; = >» €1jxAjBx ; 


dK 


and the vector C as 


c=) ec, 
i 


or 


AX B=) ce€sAjBr Eq. (8.7) 
ij,k 


We note from this suggestive form that the vector product can be written in the form of a determinant as 


€&, &2 &3 
C=AXB=]|A, Az As]. Eq. (8.8) 
B, Bz, Bs 


The cross product of two vectors has a very nice geometrical interpretation. Consider a parallelogram, in a plane, 
whose sides are made up by vectors A and B, as shown in Figure 8.11. Note that the area of this parallelogram is 
AB sin @, which is the magnitude of A x B. However, the direction of A x B is perpendicular to the plane of the 
parallelogram. Thus, in general, the area of an element such as the parallelogram defined above can be treated 
as a vector with its direction perpendicular to the plane containing that element. Note that the direction of area 
vector, either into or out of the plane, is related to the sense of traversal of the periphery of the area by the 
right-hand rule. In Figure 8.11, the direction of the area element A x B, of magnitude ABsin @, is obtained by 
traversing the periphery by going first over (solid line) vector A and then over (solid line) vector B, that is, 
traversing the periphery indicated by the thick counterclockwise arrow corresponding to direction coming out of 
the plane of parallelogram according to the right-hand rule. Similarly, the direction of the area element B x A, 
also of same magnitude ABsin @, is obtained by traversing the periphery of the parallelogram by going first over 


(dashed line) vector B and then over (dashed line) vector A, that is, traversing the periphery indicated by the 
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thick clockwise arrow corresponding to direction going into the plane of the parallelogram according to the right- 


hand rule. 


a 


AXxB 
© 


Figure 8.11. An area element as a vector. 


8.4 TRIPLE SCALAR PRODUCT 


Now let us explore how we can multiply three vectors A, B, and C to obtain either a scalar or a vector. The 


product Ae(B x C) is a scalar and is referred to as the triple scalar product. In terms of components, 
Ae(B XC) =A,(B XC), + A,(B XC), + A3(B XC), 
= A, (B2C3 — B3C,) + A2(B3C, — By, C3) + A3(B,C, — B2C;). 
The expression on the right-hand side can be rewritten in two alternate forms: 
first form = B,(C,A, — C3A,) + B2(C3A, — C,A3) + B3(C, Az — C,A,) = Be(C XA), 
or 


second form = C,(A,B; — A3B,) + C,(A3B, — A,B3) + C3(A, Bz — A,B,) = Ce(A XB). 


Thus, the triple scalar product can be written as, 


Ae(B x C) = Be(C x A) = Co(A x B) = —Ac(C X B) = —Be(A X C) = —Ce(B x A). Eq. (8.9a) 


In other words, dot and cross can be interchanged in a triple scalar product. The triple scalar product can also be 


written in the form of a determinant: 
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A, Az A3 
Ae(BXC)=]B, Bz Bl) , Eq. (8.9b) 
C, & C3 
or, in terms of Levi-Civita symbol, 
Ae(B XC) = > ee Ge Eq. (8.9c) 
ik 


The triple scalar product has a nice geometrical interpretation. Consider the parallelepiped made up by vectors 
A,B, and C as shown in Figure 8.12. The area of the base of the parallelepiped is the parallelogram A xX B. The 


direction of this area element is along n, the normal to the base. Then, 


(A x B)eC = |A x B|(neC) 


= (area of the base times height) = volume of the parallelepiped. 


C cos o 


Figure 8.12. A parallelepiped made up by vectors A, B,and C. 


8.5 TRIPLE VECTOR PRODUCT 


Now we explore the possibility of obtaining a vector by multiplying three vectors A, B, and C. In the triple vector 


product A x (B X C), the order of multiplying vectors is important. For example, 


e, X (e, X en) # (€, Xe) X ey, 


since the left-hand side is zero because the vector product of a vector by itself is zero, but the right-hand side is 


nonzero, equal to e3 X e, = —e,. In general, the triple vector product can be written as 
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AX (B x Cc) =AXV= > ened Eq. (8.10a) 
ij,k 


where 


V=BxC and Vv; = EBC Eq. (8.10) 


Lm 


On substituting for V; from Eq. (8.10b) into Eq. (8.10a), we get 


Ax(B xC)= > €ijx@iA; > EximBiCn- 
lm 


ijk 


To carry out the sum over the repeated index k, we use the epsilon-delta identity (see Appendix C for proof), 


a EijkEkim =>. Exij Ekim = 6i15jm a SimOj 


k k 


so that the triple vector product looks like: 


Ax(B xC)= a €A;B,Cn [Si Sjm = Simi]: 


ij lm 
Because of the Kronecker deltas, the sum over the repeated dummy indices / and m can be carried out easily, 


ij i j j 


= B(AeC ) — C(AcB ). Eq. (8.11) 


The order of vectors appearing on the right-hand side is remembered by using the mnemonic BAC-CAB. Another 


form of triple vector product is 


(A x B) xX C= —C x (AX B) = —A(BeC) + B(CeA), Eq. (8.12) 


after using the BAC-CAB rule in the last step. Now, the order of vectors in the two forms of the triple vector 


product, namely, Eqs. (8.11) and (8.12), can be determined by: 


Vector product of three vectors = middle vector times scalar product of the remaining two vectors minus (—) 


other vector in the parenthesis times scalar product of the remaining two vectors. 
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PROBLEMS FOR CHAPTER 8 


1. Using the concept of scalar, or dot, product of two vectors, determine the angle between vectors A = e, + 


Ze, + 2e, and B =2e, — 3e, + 6e,. 


2. Prove that the vectors A = 3e, + ey — 2e,,B=-e, +3e,+4e,, andC =4e, - 2 e, -6 e, can form the sides 


of a triangle. Find the lengths of the medians of this triangle. 


3. Using the concept of vector, or cross, product of two vectors, determine the area of a triangle with vertices at 
points A(1,2,3), B(2,4,5), and C(1,-8,0). In our notation here, P(x, y, z) refers to the Cartesian coordinates of the 


point P. 


4. Determine a unit vector perpendicular to the plane containing vectors A = 2 e, — 6e, — 3 e, andB = 4e, + 


3 ey — e,. 


5. The eight corners of a parallelepiped are labeled as C1, C2, C3, Ca, Cs, Ce, C7, and Cg. The Cartesian (x, y, Z) 
coordinates of the corners are C; (5, 2, 0), C2 (2, 5, 0), C3 (4, 5, 0), Ca (7, 2, 0), Cs (6, 3, 3), Ce (3, 6, 3), C7 (5, 6, 3), 
and Cg (8, 3, 3). 


(a) Determine the three vectors A, B, and C that define this parallelepiped. 
(b) Determine the volume of the parallelepiped. 


[Hint: To visualize this parallelepiped, plot and label the first four and the last four corner points in two x — 


y planes separately. The two x — y planes are z = O plane andz = 3 plane.] 

6. Prove the following identity for vector triple product: 
Ax (Bx C) + Bx (CX A) + Cx (AX 8B) = 0. 
7. Prove the following two identities: 
(A x B) ¢ (CX D) = (Ae C) (Be D) — (A*D) (Be C), 
(A x B) xX (CX D) = [(A x B) e D]C — [(A x B) e C]D = [(C x D) * A] B — [(C XD) @ BJA. 
8. If A and B are any two vectors, then show that 
(A x B)e(A xB) = A7B? — (AeB)?. 


9. Arhombus is a parallelogram with all four sides of equal length. Using the concept of scalar (or dot) product 


of vectors, show that the two diagonals of a rhombus are always perpendicular to each other. 


153 


10. Biomedical Physics Application. The “life”, as we know it, is based on the chemistry of carbon-containing 
molecules. An understanding of the structure of these molecules is essential to understanding life. One of the 
simplest hydrocarbon molecules is methane (CHa), in which four hydrogen atoms are placed symmetrically at 
the vertices of a tetrahedron around a single carbon atom. Alternatively, the four hydrogen atoms are located at 
the four corners of a cube, as shown in the figure below, with the carbon atom at the center of the cube. The 
bond angle is the angle between the lines that join the carbon atom to two of the hydrogen atoms. Using your 


knowledge of the scalar product of vectors, show that the bond angle in methane molecule is 109.5°. 


[Hint: Take the origin of the coordinate system to be at the center of the cube (at carbon atom) of side 


length 2a and determine the coordinates of each hydrogen atom.] 


11. Consider an arbitrary triangle whose three sides are made up of three vectors A, B and C, then according to 
vector addition, A+ B + C=0. The three medians of this triangle, namely My, Mg, and Mc, are also vectors. The 
median Mg has its tail at the vertex opposite A and its head at the midpoint of A. There is similar construction 


for other two medians M, and Mc. Using vector addition, show that 
1 
M, = — 3 (B—C), 


1 
M, =—3(C—A), 


1 
Mc = —5(A-B). 


12. The three vertices of a triangle in the x — y plane have Cartesian coordinates (xj, y;), (x2, yz), and (x3, y3). 
Using the facts that three medians of a triangle intersect each other at a common point, called a centroid, and at 
this point each median is trisected, show, using vector addition, that the Cartesian coordinates of the centroid 


are 
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Xy t+ Xz +X3 Wy + V2 “#2 


(Xe Ye) = ( 3 , 3 


13. The three vertices of a triangle in the x — y plane have Cartesian coordinates (x,, y), (Xz, ¥2), and (x3, y3). 


Using the cross product of two vectors, show that the area of this triangle is given by the following determinant: 


1 1 1 1 
area = 2 xX, X2 X3}. 
V1 y2 3 
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Chapter 9: Curvilinear Coordinates and Multiple Integrals 


In this chapter, we will introduce two important curvilinear coordinate systems. These are the spherical 
coordinate system and the cylindrical coordinate system. They are very useful in carrying out multiple integrals 


encountered in problems with spherical and cylindrical symmetries. 
9.1 CARTESIAN COORDINATES 


First, we review the details of the well-known and more common rectangular Cartesian coordinate system. In 
this system, a point has three coordinates (x, y, z) which are the distances perpendicular to three mutually 
orthogonal surfaces. All three coordinates (x, y, z), vary independently from —oco to +00. Then we define some 


families of surfaces and families of curves for Cartesian coordinate system. A surface on which the value of the x 


Figure 9.1. A point P in space with Cartesian coordinates x, y, z. 


coordinate is constant, while the values of the y and z coordinates can vary, is called the x = constant surface. 
This surface is a plane. Similarly, y = constant and z = constant surfaces are planes, as shown in Figure 9.1. A 
curve along which the x coordinate varies, while the y and z coordinates are constant and fixed is called the 
x-curve (or, x-axis). This curve is a straight line. Similarly, the y-curve and z- curve are straight lines. The unit 
vectors ex, ey, and e, are tangentially along the x-curve, y-curve, and z-curve, respectively. Consider a point, P, 


in space with Cartesian coordinates (x, y, z). The vector, shown in Figure 9.1, 
r=xe,+ye,+ze,, 


locates the position of point P with respect to the origin. A length element dl in this coordinate system is 
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dl=axe, +dye,+ dz e,, Eq.(9.1a) 


and the distance between two neighboring points is 


dl =  [(dx)? + (dy)? + (dz)?]. Eq. (9.1b) 


A volume element in Cartesian coordinate system is 


dV = dxdydz. Eq.(9.1c) 


An area element in this system is 
dS = dxdy e, + dydz e, + dzdx ey. Eq.(9.1d) 


9.2 SPHERICAL COORDINATES 


Again, consider the same point, P, in space located at a position given by the vector r as shown in Figure 9.2. The 
spherical coordinates (1, 6, @) of point P are defined as follows. The coordinate r is the length of vector r and its 
range is 0 <r < oo. The coordinate @ is the angle between vector r and the z-axis and its range isO <6 <7. It 
is similar to the angle of co-latitude in astronomy. In order to describe the third spherical coordinate, @, we need 
to focus our attention on two planes, the y = 0 plane (or the x- z plane) and the plane containing the vector r 
and the z-axis. Then, @ is the angle between these two planes. The coordinate ¢ is similar to the angle of 


longitude in astronomy and its range is 0 < @ < 27. The surface r = constant is a sphere of radius r. The surface 


Figure 9.2. The point P of Figure 9.1 shown with its spherical coordinates r, 0, p. 
6 =constant is the curved surface of a cone having vertex at the origin. This cone becomes the z = 0 plane if 
6 = 1/2, becomes the +z-axis if 9 = 0, and becomes the —z-axis if 9 = m. The surface @ = constant is a semi- 
infinite plane. A curve along which the r coordinate varies, while the @ and @ coordinates are constant and fixed, 


is called the r- curve. This curve is a straight line. Similarly, the @- curve is a semicircle and the @- curve is a full 
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circle. The unit vectors e,, €g, and €y are tangentially along the r-curve, 6-curve, and @-curve, respectively. 
The relationships between spherical coordinates (1, 8, @) and the Cartesian coordinates (x, y, z) are 
x=rsinOcos¢ 
y=rsin@sing Eq.(9.2a) 
zZ=rcos8, 


or the inverse relationships, 


Il 


(x? + y? + 27)'/2 


Zz 
0 = arccos a Eq. (9.2b) 
od = arctan(y/x) 
Thus, 
dx = sin@ cosg dr + r cos@ cos dé —r siné sing dd, 
dy = sin@ sing dr +r cos@ sing dé + r sin@ cosd dd, Eq. (9.3) 


dz =cos@ dr —rsiné@ dé. 


The distance between two neighboring points is 
(dl)* = (dx)* + (dy)? + (dz)’, 
or, on substituting for dx, dy, and dz in terms of dr, d@, and dd from Eq. (9.3), 


(dl)? = (dr)? + (r d@)? + (r sind dg)’. Eq.(9.4a) 


Thus, we note that the roles of length elements dx, dy, and dz of Cartesian coordinates are taken over, in 
spherical coordinates, by length elements dr, rd, and r sin 6 d@. Thus, a volume element in spherical 


coordinates is 


dV = r*sin@ dr dé dd, Eq.(9.4b) 


and an area element is 


dS = rdrdé eg + r* sin 6 déd¢ e, +r sin 6 drdd eg. Eq. (9.4c) 
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9.3 CYLINDRICAL COORDINATES 


Once again, consider the same point, P, in space with Cartesian coordinates (x, y, z) and spherical coordinates 
(r, 0, p). The cylindrical coordinates (p, d, z) of this point are shown in Figure 9.3 and are defined as follows. The 
cylindrical z coordinate is same as the Cartesian z coordinate. Thus, the range of the z coordinate is -oo < z < 
oo. The cylindrical @ coordinate is same as the spherical @ coordinate. So, the range of this coordinate isO < @ < 
21. The cylindrical p coordinate is the projection of the position vector r onto the Cartesian x- y plane. The 
range of this coordinate is 0 < p < ©. The surface z = constant is a plane perpendicular to the z-axis. The 
surface @ = constant is a semi-infinite plane. The surface p = constant is a cylinder coaxial with the z-axis. As in 


Cartesian coordinates, the z-curve is a straight line. As in spherical coordinates, the @- curve is a full circle. The 


Figure 9.3. The point P of Figures 9.1 and 9.2 shown with its cylindrical coordinates p, @, z. 
p-curve is a curve along which the coordinate p varies while the other two coordinates ¢ and z are held 
constant. This curve is a straight line. The unit vectors €5, €g, and e, are tangentially along the p-curve, 
the @-curve, and the z- curve, respectively. The relationships between cylindrical coordinates (p, @, z) and the 


Cartesian coordinates (x,y,z) are 


x=pcos¢d,y=psing and z=z, Eq.(9.5a) 


and the inverse relationships are 


p= jx*+y?, ¢@ =arctan(y/x) and z = z. Eq. (9.5b) 


Using these relationships, 


dx =cos¢ dp — psing dd, 
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dy =sing@ dp +pcos®¢ dd, Eq. (9.6) 


dz = dz. 
Now, the separation between two neighboring points, given by 
(dl)? = (dx)? + (dy)* + (dz)’, 
can be rewritten on substituting for dx, dy, and dz in terms of dp, dd, and dz from Eq. (9.6), as 


(dl)? = (dp)? + (pdd)? + (dz)?. Eq.(9.7a) 


Thus, we note that the length elements dx, dy and dz of Cartesian coordinates are replaced, in cylindrical 
coordinates, by length elements dp, pd@¢ and dz. So, the volume element and the area element in cylindrical 


coordinates become 


dV = pdpdddz, Eq. (9.7b) 


and, 


dS = pdpd¢ e, + pdddz e, + dzdp ey, Eq.(9.7c), 


respectively. 
9.4 PLANE POLAR COORDINATES 


We note in passing that because the z coordinate in Cartesian coordinates and in cylindrical coordinates is the 
same, it is possible to treat the x- y plane as a p- @ plane. This relationship is a depiction of two-dimensional 
plane polar coordinates. In particular, an area element in this plane can be written as 

dS= dxdy or dS= pdp dd. Eq. (9.8) 


As an example of plane polar coordinates, we attend to an unfinished integral that we encountered in our 


discussion of Gaussian integrals in Chapter 2. We stated there, without proof, that [see remark after Eq. (2.15)] 


co 


p= | ex (-—ax?) ax=5 | 
P ~ 2Na" 


0 


Now we evaluate this integral. But, first we evaluate the related integral, 


co 


I = | exp-x?) dx. 


0 
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Then, the integral iu will be obtained from J by replacing x? by ax”. We write the J integral twice, first with 


dummy variable x and next with dummy variable y, and multiply the two integrals to get 


LP? =| | exp[—(x? + y”)] dx dy. 
0 oO 


Now, think of this integral as an integral over the first quadrant of the x- y plane. Converting to plane polar 


coordinates in the first quadrant, 


coo «6/2 © 
mu 1 1 
[e= | | exp(—p”) pdp do = 5 al exp(—u) du = ry 
p=0 p=0 0 
with u = p2. Thus, 
ra 
f= | exp(—x?) dx = ule Eq.(9.9) 


0 


Now, to get the Gaussian integral iS simply replace x? by ax? in Eq.(9.9). This gives 


co 


19 = | ex (—ax?) ax=5 | 
a P ~ 2Na’ 


0 


9.5 MULTIPLE INTEGRALS 


The curvilinear coordinates, discussed above, are very useful in carrying out integrals which involve areas or 
volumes of circles, spheres and cylinders. A few examples will impress the reader about the utility of curvilinear 


coordinates. 
Example: Show that the area, A, of a circle of radius R is, A = 7R?. 
Solution: 


The statement of this problem is so well-known that it is almost taken as a fact. However, historically speaking, it 
was considered quite an intellectual feat by Eudoxus in the fifth century BC to realize that the area of a circle is 

proportional to the square of its radius. At the time of Eudoxus, the formula for finding out the area of a triangle 
was well-known. So, the area of a circle was determined by inscribing in it regular polygons, which can be broken 


into several triangles. If A,, is the area of an inscribed polygon of n sides, then A,i-cie = im Ay. 
n-0co 
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Figure 9.4. A circle of radius R. Figure 9.5. Calculating area of a circle. 
Now, using multiple integrals, it is very easy to determine the area of a circle. Break the circle into large number 
of small pieces by drawing various concentric circles and radial spokes inside the circle, as in Figure 9.5. The area 


of the tiny area element shown in the figure is, 


dA = (dp)(pdd) = pdpd®. 
Then, 


R 21 
R2 
ee — | dA = | pdp | dp = > (2m) Ro Eq. (9.10) 
0 0 


circle 
Example: Determine volume, V, of a sphere of radius R. 


Solution: Since a small volume element in spherical coordinates, dV, is given in Eq. (9.4b), the volume of a whole 
sphere can be determined by summing (or, integrating) over all the small volume elements that make up the 


sphere. Explicitly, 


21 


R 1 
V= dV = | | r* sin @ dédrd@, 
sphere 


r=0 0=0 p=0 


or, 
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21 1 R R3 ae 
| do | sin 0 d@ | r*dr = (2m)(2) (5) = oa Eq. (9.11) 
=0 0=0 


r=0 


9.6 SOLIDS OF REVOLUTION 


Consider a small strip of length dx located in the x- y plane at (x, y) = (x, R) as in Figure 9.6. If this strip is 
revolved about the x-axis, it creates a disk of radius R and thickness dx, and, therefore, of volume V = 1R7dx. 
Generalizing this result, if a curve, representing a function f(x) with a < x < b, is revolved about x-axis, as in 


Figure 9.6, the volume of the solid generated will be 
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Figure 9.6. Solids of revolution. 
b 
Ve | rtr@oran. Eq. (9.12) 


a 


The solids obtained by revolving a curve about an axis are called the solids of revolution. 


Example: A straight line, given by the equation 


R 
i) =5%, 


is shown in the Figure 9.7. Revolution of this straight line about x-axis leads to a solid of revolution in the form 


of a cone of radius R and height H. The vertex of this cone is at the origin and its axis lies along the x-axis. The 


volume of this cone is 


R?H? 
v=fe le [ean eadX = Tg = gh Eq. (9.13) 
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Thus, in general, the volume of a cone of radius R and height H is 7R*H//3. 


Example: Consider a horizontal line of length H in the x — y plane at y = R. Revolution of this line around x- 


axis generates a solid in the form of a cylinder of radius R and height H. Its volume is 
H 
V= | m[R]2dx = 1R*H. 
0 


Thus, in general, the volume of a cylinder of radius R and height H is 7R7H. 


2 y 
(H,R) H 
R ————— 
>y > 
X x 
(0,0) 
Figure 9.7. A cone generated as a solid of revolution. Figure 9.8. A cylinder generated as a solid of revolution. 
9.7 CENTER OF MASS 


The concept of center of mass is very useful in the study of mechanics of a distribution of masses. The center of 
mass of the distribution, under the influence of external forces, moves as if all the mass of the distribution were 


concentrated there and as if all the external forces were applied there. 


Consider a mass distribution consisting of n discrete point-like masses, Am,, Am, -:- Am; -+: Am,, located at 
14,12 °° T;{°** Ty, respectively, as shown in Figure 9.9. The total mass M is the sum of all discrete masses. The 


center of mass of this distribution is located at 


_ Am)r, + (Am) r2+...+(Am;) rit... +(Amy) Tp 
ee (Am,) + (Am) +: + (Am,) 


n 
1 
ae yn Am, - Eq.(9.14a) 
i=1 


An extended object of total mass M can be imagined as a collection of a large number of point-like masses which 


are combined together to form a continuous distribution of mass. For such an object, the sum in Eq. (9.14a) can 
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be replaced by an integral. Thus, for a continuous distribution of mass, M, the center of mass of the body is 


located at 


1 
= r dm, Eq. (9.14b) 


all body 


Am; 


0 


Figure 9.9. The center of mass of a distribution of n discrete point-like masses. 


or, in Cartesian coordinates, 


1 
on = az] # dm, 


1 
Yon = aq | yam, 


1 
tom = qq | 7am. 


For a mass distribution with some symmetry, the center of mass is normally at the center of symmetry or on the 
axis of symmetry. For example, for a full circular disk of radius R, the center of mass is at the center of symmetry, 


namely, the center of the disk. 
Example: Determine the center of mass of a semicircular disk of radius R and mass MV. 


Solution: 


In this case, the mass per unit area is un = M/ (; mR?) = 2M /(mR7). Using plane polar coordinates in the plane 


of the disk: 


165 


1 


Rw R 
1 H 
xen = az | H(p089) dp p dg =4 | pap | cose do =0 
0 0 0 


0 


R R 
- yf in) dp pd =f otap{ Py ase, cle as 
Yom = 1 LL (psind) dp p ?=a p*dp | sing o=az Ge 6 on 
0 0 0 


0 


Thus, center of mass of a semicircular disk is located on its axis of symmetry at (0,4R/372). 


Figure 9.10. Center of mass of a uniform semicircular disk. 


9.8 MOMENT OF INERTIA ABOUT AN AXIS 


In elementary physics we learn that, during a translational motion, the kinetic energy of a body of mass m 
moving with a linear velocity v is K = =mv?. Consider a person riding a stationary bicycle for exercise. Since the 
bicycle is stationary, there is no translational kinetic energy associated with the exercise machine. All the work 
done by pedaling the bicycle is converted into rotational kinetic energy of the rotating wheel. If the wheel of 
the exercise bicycle is rotating with a constant angular velocity w, then its kinetic energy is K = ~1w*, where I is 
the moment of inertia or the rotational inertia of the wheel. In order to define the moment of inertia of a mass 
distribution, imagine several discrete point-like masses Am,, Am,,-:: Am; -+-Am,, rotating about a certain 
common axis of rotation, as shown in Figure 9.11. Furthermore, 1, Tz -*: 1% ++: %, respectively, are the shortest, 
perpendicular distances of various masses from the axis of rotation. Then, the moment of inertia of this mass 


distribution about the axis of rotation is defined as 


n 
l= »s ~ Om): Eq.(9.15a) 
i=1 
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An extended object can be imagined as a collection of a large number of point-like masses which are combined 


together to form a continuous distribution of mass. For such an object, the sum in Eq. (9.15a) can be replaced by 


an integral so that 


l= | r2 dm. Eq.(9.15b) 
all body 


axis of rotation 


Am, 
i Am, 
Am, 
Am, @ 
. Am, 


Figure 9.11. Moment of inertia of a distribution of n discrete point-like masses rotating about an axis. 


If the mass is distributed uniformly in a volume V with density D, (mass per unit volume), then 


T= [r Dy av. 
4 
If the mass is distributed uniformly in an area A with density D, (mass per unit area), then 
T= [ ro, dA. 


A 


If the mass is distributed uniformly along a line Z with density D, (mass per unit length), then 
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I= | r*D, aL. 
L 
Example: Determine the moment of inertia of a thin spherical shell of mass M and radius R rotating about any 


diameter. 


Solution: The shell has surface area of 47R?. If D, is mass per unit area, D, = M/(41R7), then mass of a small 
patch of area dA = (Rd@)(R sin 8 dd) is dm = D, dA. The perpendicular distance of this patch of area from the 
axis of rotation is7 = R sin @. So, 
T 20 
l= | [R sin 0]?dm = | | [R sin @]?D, R?sin@ dé dd 


shell 0=0 ¢=0 


or, with u = cos 6, 


T 21 


al 
1 =D, R* | sint9 ao | de =D, R* 2m [ (1 —w?) du 
0 0 —il 


4, 82/ M 2 
= 2n D, R* (;) = (=z) R* ==MR?. Eq. (9.16) 


3 \47R2 3 
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Figure 9.12. Moment of inertia of a thin spherical shell. 


Example: A thin rod of length L and mass M is rotating about an axis. The axis is passing through the center of 


the rod and is perpendicular to the rod. Determine the moment of inertia of this rotating rod. 


Solution: 


Axis of rotation 


\ 


\ A 


| 
ad 
a 
ow 
; 


-------3-- 
oe 
y 


Figure 9.13. Moment of inertia of a rod rotating about an axis. 
Since the mass is distributed along the length of the rod, we can define mass per unit length as D, = M/L. 
Consider a small mass element of length dx located a distance x away from the axis of rotation. The mass of this 


element is dm = D, dx. So, the moment of inertia is 


3. 
I= [ ean = x?D e=D. 
L L 3 


13 13 
=p; |— =| =|, 
Be ib ( =| 
2 


| 
No S NE 


rod 
or, 
es a = 1 ME Eq. (9.17 
Smo) ap : OMT) 
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PROBLEMS FOR CHAPTER 9 


1. Find the position of the center of mass of a solid cone of radius R and height H. 


2. Asolid sphere, of radius R, is cut into two identical hemispheres. Find the position of the center of mass of 


such a hemisphere. 


3. Find the moment of inertia of a solid cone of radius R, uniform mass density D, height H, and mass M about 


its axis of symmetry. Give the answer in terms of M and R. 


4. (a) Determine the moment of inertia of a thick-walled right circular cylindrical tube of mass M, inner radius a, 


outer radius b, and height h, about its central axis of symmetry. 


N 


(b) Using the result of part (a), determine the moment of inertia of a solid right circular cylinder of radius R 


and mass M. 


(c) Using the result of part (a), determine the moment of inertia of a right circular cylindrical shell, open at 


both ends, of radius R and mass M. The thickness of the shell can be considered as almost zero. 


5. (a) Determine the moment of inertia of a thick spherical shell, of mass M, about an axis passing through its 


center. The inner and outer radii of the shell are a and b, respectively. 
(b) Using the result of part (a), determine the moment of inertia of a solid sphere of radius R and mass M. 


(c) Using the result of part (a), determine the moment of inertia of a thin spherical shell of radius R and 


mass M. The thickness of the shell can be considered as almost zero. 


6. Biomedical Physics Application. The human heart is a muscular organ that is responsible for pumping blood 


throughout the body. Without resorting to an actual surgery, the size of a healthy heart can be estimated by 
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using a medical imaging technique called CAT scan or simply CT scan (for computed tomography). The scan 
produces equally spaced cross-sectional views of the heart. Suppose that a CT scan of a human heart shows 
cross sections spaced 1.0 cm apart. The heart is about 10 cm long and the cross-sectional areas, in square 
centimeters, are 0, 18, 38, 57, 72, 84, 71, 60, 39, 20, and 0. Using this information estimate the volume of the 


heart. 


7. Biomedical Physics Application. The velocity v of blood that flows in a blood vessel with radius R and length 


L at a distance r from the central axis is 
P 
= R2 Cand . 
Oa OM) 


where P is the pressure difference between the ends of the vessel and 1 is the viscosity of the blood. The flux 
(volume of blood flowing per unit time) F of the blood can be calculated by approximating the cross-sectional 


area of the blood vessel by concentric rings and integrating over all such rings. Show that the flux is given by 


_ mPR* 
8nL ° 


This is called Poiseuille’s Law. 
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Chapter 10: Vector Calculus 


In this chapter we will combine our previous knowledge of calculus and of vectors to learn about vector functions 
and their derivatives. We will also gain knowledge of a couple of specialized theorems, the Gauss’s theorem (or, 


divergence theorem) and Stoke’s theorem, which are relevant for vector functions. 
10.1 VECTOR FUNCTIONS 


A vector function is a vector whose components are functions of coordinates. As an example, a vector function in 


Cartesian coordinates is 


A(X, %2,%X3) = 1A; (X41, X2,%X3) + €2A2(%y, X2,X3) + €3A3(%1, Xp, X3). Eq. (10.1) 


Here e,, €2, and e3 are the three orthogonal unit vectors and A,, A,, and Az; are the three scalar functions of 


coordinates x,, Xz, and x3. 


Now, in vector calculus, just as in multivariate calculus, all derivatives are related to the rate of change of 
. : es af >. 
a function. For a scalar function f (x1, x2, x3), the three derivatives, ot (i = 1,2,3), represent the rate of 
i 


variation of the function f with respect to x; (i = 1,2,3). These three derivatives of f behave like the three 


components of a vector, which is called the gradient of f. The gradient of f is written as 


of of of 
Visa CE ee 3 Ox,” Eq. (10.2) 


Note that gradient of f is a vector function. The differential operator 


3 


—— 0 " 0 n 0 -) 0 
~ Stag, 2 Ox, 3 Ox, "i 9x, 


i=1 


is called the del or gradient operator. Let us now interpret the gradient of a scalar function. Consider the values 
of a scalar function f at two neighboring points in space. The first point is at r = (X,, X>,X3) and the value of the 
function there is f (x1, x2, x3). The neighboring point is atr + dr = (x, + dx, xz + dx2,x3 + dx3) and the value 
of the function there is f(x, + dx1, xz + dx2,x3 + dx3), as shown in Figure 10.1. The change in the value of the 


function, f, on moving from one point to a neighboring point is 
df = f(x, + dx,, x2 + dx2,x3 + dx3) — f(X1,X2,X3), 


or, keeping only the first order terms in the Taylor-series expansion of the function, 
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(x, + dx,, xX, + dx x,+dx,), f+df 


(X},Xy%y), F(r) 


Figure 10.1. Values of a function at two neighboring points. 


of of of 
df = Fee + wee? + ae = Vfe dr. Eq. (10.3) 


This expression leads to the following properties of the gradient of a scalar function f. 


First, the change, df, in the value of the function is maximum when Vf is parallel to dr. Thus Vf is a vector that 
represents the magnitude and direction of the greatest rate of change of f. Second, if dr is tangential to the 


surface f(r) = constant, then both r and r + dr lie on the surface. So, 
df = f(r + dr) — f(r) = 0, 
or, 
Vf edr=0. 


Since dr is tangential to the surface, it means that Vf is normal (or, perpendicular) to the surface f(r) = 
constant. We note in passing that in electrostatics the electric field, E = —V@, is normal to an equipotential 
surface on which the electrostatic potential, @, is constant. This fact is a direct consequence of the second 
property of the gradient of a function. The rate of variation of the scalar function f along any arbitrary 


direction n is given by neVf, which is called the directional derivative. 
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Divergence and Curl of a Vector Function 


Because of the vector nature of the del operator, V, we can form the following two combinations for a vector 
function A: 
0A, OA, OA; 


0 
= e(e,A, ata eA, ata e3A3) = ax, ata Oxy ar Ox, Eq. (10.4) 


0 0 
vea=(er5—ter5—t e3 


which is called the divergence of function A, and 


0 0 
vxa=(e,——-+e,-—+e,-—] X (e; A, + e2A, + €343) 
al rz: 3 


i = =) eS =) ( =) 
=e 2 3 


which is called the curl of function A. If, for a vector function A, VeA = 0, then Ais called a solenoidal vector 
function. On the other hand, if V x A = 0, then A is called an irrotational or conservative vector function. Finally, 
we define VeVf, which is the divergence of the gradient of a scalar function, as the Laplacian of f. It is written as 


V’ f. Explicitly, 


2 2 2° 
Ox, OxZ Ox5 


0 0 0 of of of a’f a*f arf 
2 es — 
es (e: Ox, Tea Ox es x) : (e: Ox, ey Ox vs x) 


Eq. (10.6) 


The Laplacian operator, V2, can also operate on a vector function A(x, x2,X3) as, 


V2A = e,V2A, + €2V2A, + €3V2A3. Eq.(10.7) 


Note carefully that since the Cartesian unit vectors e;, e2,and e3 are constant in magnitude and direction, they 


can be taken out of the V2 operator. Finally, we prove an important identity, 


Vx(V XA) = V(Ve A) -V2A. Eq. (10.8) 


To prove it, we start with the left-hand side of the identity, 
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i) 
Left Hand Side = > €ijx & Pm X A}; 
J 


ik 
0 
=) fit ej at campo an] = Yyeaz ei Ox; <A m >, €1tj€eim 
ijk im us ' 
aa 09 
= DY ee FA (Budi — bind) = ee alee 
ij lm i : 


=D “inom Ox; pad = V(VeA) — V2A. 


i 


Here we used the epsilon-delta identity of Appendix C. The above vector identity is valid only for Cartesian 


components of the vector function A(x,, Xz, X3) since the unit vectors e,, €2,@3 are taken to be constant in 


magnitude and direction. 


In the remainder of this chapter, including the following examples, it will be easier to use (x, y, Zz) notation 


instead of (x1, X%2,%3) notation. However, for the Cartesian unit vectors we will continue to use e,, €2,and e3. 
Example: Consider the scalar function, 

f(xy,z) = +y?—-2* +xy—-—xz- yz. 
Determine Vf, unit vector along Vf, directional derivative along e, + e,, and V7f at (1, 1,1). 


Solution: Starting with gradient of f, 
of of of 
Vio El og 2 ay +e ©3435 
=e,(2x +y—z) +e, (2y+x—z) +e3(—2z—-—x-y). 
Vf at (1,1,1) = e,(2) + e2 (2) + e3(—4). 


1 1 
Unit vector along Vf at (1,1,1) = —=(2e, + 2e, — 4e3) = ~(e, + eg — 2€3). 
V24 v6 


In order to determine the directional derivative along e, + e2, we first determine the unit vector along e, + ez, 


which we will label as 7. It is 
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1 
n= aes +e). 


So, the directional derivative is 


neVf = zee +e,)e[e,(2x +y —z) +e,(2y +x —z) +e3(-2z-—x-y)] 


adnate ay Apes en ee a ea aay 
x+y—-—z+2y+x-2z]) = ————_ = — at(1,1,1). 
V2 V2 


af 
~ 2 


Finally, 


O° f OFF O7f 
2f — . + _ = 
Wie Ox? dy? az? eee a 


Example: Given the vector function 


A=x*ye, +y’ze,+2z7xe3, 


determine its divergence, curl, and Laplacian. Then, explicitly verify that 


Vx (V XA) = V(VeA) — V2A. 


Solution: The divergence of A is 


OA, OA, OA 
~+— F 2 = 2xy + 2yz + 22x. 


VeA= 
° Ox oy 


The curl of A is 


ey e€2 e€3 
x7) = —-y’e, — ze, 


0 0 0 
VxA= Ox dy a = e,(0 —y”) + e,(0 —z”) + e3(0 _ 
x*y y*z zx 


The Laplacian of A is 
VA = e,(2y) + e,(2z) + e3(2x). 


Now we verify the identity, 
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4 €2 e€3 


0 0 
Vx (V x A) = ax ay az = e,(2z) + e, (2x) + e3(2y) — 2[ze, + XE2 + ye3| 
—y? —72 —x2 


V(VeA) = e,(2y + 2z) +e, (2x + 22) + e3(2y + 2x). 
On substituting explicitly, 
V(VeA) — V7A = 2ze, + 2xe, + 2ye; = VX (V XA). 


Example: Find V f(r) wherer = xe, + ye, + ze; with r? = x? + y” + z”. Note f(r) is a function of only 


the magnitude r of the vector r. 


Solution: Starting with gradient of f(r), 


0 0 0 
VIM =e1s-f&) +e2 ayi + e3 a f(r) 


ag TO OO 
~ “lax dr 2ay dr 3az dr 


Use 


to obtain 


x@;+tye,+zez,df_ rdf 


i) a a 


10.2 LINE INTEGRALS 


The line integral of a vector function A over an arbitrary path joining point 1 to point 2 is written as, 


2 
i= Aedr. Eq.(10.9) 
ak 


It is evaluated by breaking the path, shown in Figure 10.2, into a large number of small segments of lengths 
dr,, dr,, -:- dr; --- dr,,. For each tiny segment, we calculate the value of the integrand, namely, A;edr; and add all 
the values. The limiting value of this sum as the number of segments becomes very large or as the segment sizes 


become very small is the value of the line integral. In other words, 
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2 n 
i= | Aedr = lim > avedr, 


From this definition of the line integrals, we can derive an important result. Recalling from Eq. (10.3) that for any 


scalar function 


df =Vfedr, 


Figure 10.2. The path for a line integral. 


we obtain 
f(a) — fQ) = (Vf).° dry, 
f(b) -f@ = (Vf)2° dr, 


f(c) —-f®) = (Wf)3¢ drs, 


f(2)-f@ =(Wf)n dr. 


Simply adding all these lines, we get 


f2)- FO) = Vwpiedn > | vfear, 


t=1 


independent of the path joining points 1 and 2. In particular, for a closed path, 
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p(Uf)ear = 0} Eq.(10.10) 


for any scalar function f (x, y, z). The symbol of a circle on an integral sign indicates that the path of integration 


is a closed path. 


10.3 SURFACE INTEGRALS 


Figure 10.3a. Comparison of an open surface versus a closed surface. 
In surface integrals, we encounter two different kinds of surfaces, open surfaces versus closed surfaces, as shown 
in Figure 10.3a. An open surface has two sides with a boundary C but no enclosed volume. An example of such a 
surface is a flat or a crumpled sheet of paper. In order to go from one side of an open surface to its other side, 
one has to cross the boundary C. A closed surface, on the other hand, has an enclosed volume but no boundary. 
Examples of closed surfaces include the surface of the Earth, the brown surface of an Idaho potato, etc. The 


surface integral of a vector function A over an arbitrary surface, open or closed, is written as 


J= [ Aras = [Aen dS. Eq. (10.11) 
Ss S 


Here nis the normal to the area element dS. The surface integral is evaluated by breaking the whole surface 
area, shown in Figure 10.3b, into a large number of small pieces of areas AS,, AS, --- AS; --- AS,,. For each small 
piece, we calculate the value of the integrand, namely, A;*AS; and add all the values. The limiting value of this 


sum as the number of pieces becomes very large or as the size of all pieces becomes very small is the value of 


the surface integral. In other words, 
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n 
5 n> £ 
t=1 


In the case of a closed surface, this integral is called the flux of vector function A out of a closed volume V. 


Figure 10.3b. Breaking a surface area into a large number of small area elements. 


10.4 VOLUME INTEGRALS 


The volume integral of a scalar function f(x, y, z) over an arbitrary volume V is written as, 


k= | fey.) dV. Eq. (10.12) 


It is evaluated by breaking the volume, shown in Figure 10.4, into a large number of small elements of volume 
AV,, AV, +++ AV; «+» AV,,. For each little element, we calculate the value of the integrand, namely, f (x;, y;, Z;)AV; 


and add all the values. The limiting value of this sum as the number of volume elements becomes very large 


Figure 10.4. Breaking a volume into many small volume elements. 


or as the element size becomes very small is the value of the volume integral. In other words, 
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n 
K= | fe,y.2) av = lim YP" fev y. 208M 


10.5 FLUX OF A VECTOR FUNCTION: GAUSS’S (OR DIVERGENCE) THEOREM 


The flux of a vector function A out of a closed volume V has a very interesting property. If we break the volume V 
into two smaller pieces, then the sum of fluxes of A out of two closed smaller volumes is equal to the flux of A 


out of the original large, closed volume V. Let us prove it. 


Figure 10.5. Breaking a volume V into two smaller volumes V, and V2. 


Consider a closed surface S enclosing a volume V, as in Figure 10.5. For clear visualization, imagine this surface to 
be the brown surface of an Idaho potato. In order to determine the total flux of a vector function A out of 
volume V, we first consider the flux of A through a small surface element dS. It is Ae n dS = A edS. Total flux of 
A out of volume V is S,Aeds. Now break the volume V into two pieces by cutting the potato with a knife. The 
two cut pieces have volumes V, and V, and the corresponding brown surface areas are S, and S,, respectively. 
Clearly, V =V, + V, and S = S, + S;,. The cutting of the potato also exposes the interior white surface area, S,,, 
that is common to both pieces of the potato. The volume V, is enclosed by surface S; = S, + Sq, and the volume 


V, is enclosed by surface S, = S, + Sqy. Total flux of A out of volume V;, is 


aends + | Aen, ds, 


Sa Sab 


where n, is the outward normal on the Sqy part of S,. Total flux of A out of volume V, is 


[ asnas+ | Aen, dS, 


Sb Sab 


where n, is the outward normal on the S,, part of S,. Now, the sum of total flux of A out of volume V,; and the 


total flux of A out of volume V, is 


[ acnas+[ Aen ds+ | aends+ | Aen, dS. 


Sa Sab Sb Sab 
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Noting that n, = —n,, the sum of fluxes out of volume V, and out of volume V, becomes, 


[ acnas+| aenas = | Aen ds = [aends, 
Sa Sb SatSp s 

which is equal to the total flux of A out of volume V. We can generalize this result by subdividing a giant volume 

V into a large number of smaller component volumes, of any size and shape. Then the sum of fluxes of A out of 

smaller component volumes is equal to the flux of A out of giant volume V. For convenience, we take each small 


component volume to be a parallelepiped in shape. 
Flux of A Out of a Parallelepiped 


Now, we will determine the flux of a vector function A out of a parallelepiped of side lengths (Ax, Ay, Az). Total 
volume of the parallelepiped is AV = AxAyAz and its surface area AS consists of six faces; namely, left face, right 
face, front face, back face, top face, and bottom face, as shown in Figure 10.6. Total flux of A out of the 
parallelepiped, dig Ae dS, is the sum of six terms each representing a flux out of a different face. We will 


determine the flux out of three pairs of faces, the left-right pair, the front-back pair and the top-bottom pair. 


Figure 10.6. Flux of vector function A out of a parallelepiped. 


Flux out of the left face 1= f,, AendS = — f,, A, dx dz = —A,(1) AxAz 


Flux out of the right face 2= f,, Aen dS = f,, A, dx dz = +A,(2) AxAz 


Total flux out of the left-right pair of faces = [Ay (2) — Ay (1) ]Axdz = (=) ay] AxAz = “> AV. 
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Similarly, total flux out of the front-back pair of faces = (~) Ax| AyAz = “AV and total flux out of the top- 


. OAz _ OAz 
bottom pair of faces = (*) Az| AxAy = 3; AV. 
Adding the contributions from all six faces, total flux of A out of volume AV is 


i Aends = (Ve A)AV. 
AS 


Now, any arbitrary volume V can be broken into a large number of small parallelepipeds as shown in Figure 10.7, 


so that, in general, for a closed surface S, 


Figure 10.7. Breaking an arbitrary volume into a large number of parallelepipeds. 
I Aen dS = (VeA)dV. Eq. (10.13) 
Ss enclosed V 
This is known as the Gauss’s theorem or the divergence theorem for vector functions. 
Example: Consider a vector function: 


A=xye, + yZe2 + 2x e3, 


and a parallelepiped with side lengths a, b,c and volume V = abc. Show that the divergence theorem is 


satisfied. 
Solution: The divergence of the vector function is 
VeA=yt+z+x. 


The integral of this divergence over the volume of the parallelepiped of Figure 10.8 is 


[ vena =| (+y+2) dedyd, 
v v 
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a b c a b c a b c 
[ veaav =[ xdx [ ay [ az+[ x{ ydy | az+[ ax { ay { 2dz 
4 0 0 0 0 0 0 0 0 0 


az 2 c2 


V 
= zbetazctab = z(atbtc). 


V=abe 


====- N 


Figure 10.8. A parallelepiped with side lengths a, b, and c. 
The surface integral of the divergence theorem over the six faces (left-right pair, front-back pair and the top- 


bottom pair) of the parallelepiped is 


[ aenas = | aends + { 
Ss front 


aends + { AendS 
back 


left 


+| aends + { aends + { Aends 
right top 


bottom 


Ae(-evdyde + | Ae(—e,)dxdz. 


left 


=, As(e:)dydz + | 
front 


back 


+| Ae(e,) dxdz + i Ae(e3) dxdy + Ae(—e3) dxdy 
right top 


bottom 


b rc 
=i, i ay dydz+f f (—0)dydz+f f (—0)dzdy 


a rc a rb 
+| | bzdxdz + { | cx dxdy +f f (-0)dxdy 
0 0 0 0 


b? c* V 


=azetab + 7be=sZ(atbtc). 
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Thus, divergence theorem: 


i Aen ds = | (Ve A)dV, 
six faces 


parallelepped 
is satisfied. 
10.6 CIRCULATION OF A VECTOR FUNCTION: STOKE’S THEOREM 


Consider an open surface S whose boundary is the closed loop L as in Figure 10.9. We define the circulation of 


a vector function A in closed loop L as the line integral 


f Acdr. Eq. (10.14) 


Figure 10.9. Circulation of a vector function A in loop L. 
Here dr is everywhere tangential to the loop L. In general, the circulation of a vector function in a loop L has a 
very interesting property. Imagine that the area enclosed by loop L is broken into two smaller areas, with each 
area enclosed by a smaller loop. Then, the sum of circulations of a vector function A in two smaller loops 


(calculated in the same sense as in the original loop) is equal to the circulation of A in the large original loop L. 


L, - 77 ~hLe 
— L S\ 
° 4 |) | 
em jo 
41 | | \ / 
/ email L Ir, aaa / 
( be / 


Figure 10.10. Breaking a loop L into two smaller pieces L, and Ly. 
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In order to have a clear visualization, imagine that the large loop is made up of a blue color line as in Figure 
10.10. This loop encloses an area S. Now, using a pair of scissors, we cut the area along the line L,,, to get two 
smaller areas S, and Sz. Also, during this cutting, the blue line is cut into two pieces L, and L,. Clearly, S = S, + 
Sz and L = La + Ly. Area S, is enclosed by loop L, = Lg + Lap and area Sy is enclosed by loop Lz = Ly + Lay. 


Now, circulation of Ain loop L, is 


| acdr+ | Aedr,, 
GE 


a Lab 


where dr, is along the line of cut L,,. The circulation of Ain loop L, is 


| 


where dr, is along the line of cut L,,,. Thus, the sum of the circulations of Ain loop L, and in loop L, becomes 


[ avars | 


a Lab 


acdr+{ Aedr,, 


b Lab 


acdr+ | Ae dry. 


Lab 


acdr, + | 


Lp 


If the sense of circulations in the two smaller loops is the same, as shown in Figure 9.10, then dr, = —dr,. So, 


the sum of circulations in loops L, and L, becomes 


[ acar+ | acar= | Acdr= | Acdr, 
L Lp LatLp L 


a 


which is equal to the circulation of Ain loop L. This result holds even when S, and S, are not coplanar. 


Figure 10.11. Planar versus nonplanar open surfaces. 
Generalization is achieved by noting that loop L can enclose an infinite number of open surfaces. These surfaces 
need not be planar in shape and can be warped like a crumpled sheet of paper, as in Figure 10.11. Any one of 
these surfaces can be broken into a large number of smaller surface areas of any size and shape. Then the sum of 
circulations in loops surrounding these smaller surface areas is equal to the circulation in original loop L. For 


convenience, we take each of the smaller surface areas as a flat rectangle. 
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Circulation of A in the Loop Surrounding a Flat Rectangle 


Now, we will determine the circulation of a vector function A in the loop surrounding a flat rectangle, in the x- y 
plane, of side lengths Ax and Ay, as shown in Figure 10.12. The area of the loop is dS = AxAy e3. the loop 


consists of four sides of the rectangle labeled 1, 2, 3, and 4. So, the circulation of A has four contributions, 


Aedr = [ Acax + [Avay = [Acar = [ vay, 
1 2 3 4 


Rectangle 


IR 


A,(1)Ax + Ay(2)Ay — A,(3)Ax — A, (4)Ax, 


—[A,(3) — A, (1) JAx + [Ay (2) — Ay (4) Ay, 


On=e; 


Figure 10.12. A rectangular loop in the x- y plane. 


or 


aedr = —| 


Rectangle 


OA, dA, 
ay ay| Ax + a ax| Ay =(V XA), AxAy = (VX A)ends. 


Now, going back to any general open surface, with loop L as its boundary, that is broken into a large number of 


smaller surface areas in the form of flat rectangles: 


f Aedr = | (V x A) ends. Eq.(10.15) 
1 Any open surface bounded by L 


This is the statement of the Stoke’s theorem. 
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Example: Consider a vector function A = xy e; + yz ez + Zx e; and a square of side a in the y- z plane. Show 


that Stoke’s theorem is satisfied by the vector function A and the square loop. 


Solution: 


For this vector function, the curl is 


€, €2 6&3 
(ens) 0 
ax dy dz 
xy yZ 2x 


VxA= 


_ a a 7) 7) 7) ) 
=e, ex) -509| +e [5 oy) —5,0|+e3 [5-02 — 5 oo] 


or 


VxA= —eyy- C2Z — €3Xx. 


On=e, 


Figure 10.13. A square loop in the y- z plane. 


Since the square lies in the y- z plane and we will traverse it in the counterclockwise sense, n = e,. So, 


a a 
[ovxayen as = | (-e y — @22 — @3x) © e, dy dz 
S 0 YO 


a F A az a3 


Now, four sides of the square are labeled with numbers 1, 2, 3, and 4. Circulation of A in the square is 
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asdr=| Ixy dx + yz dy + 2x dz] =0+ [ yzdy+t0, 
Square 14+2+3+4 1 


+2+3+4 


since x = 0 everywhere on the square in the y- z plane. So, circulation becomes 


[,.Brar=[@=ordy+ [yz @=0+[ @=asy+ | ye (@y=0) 


3 


° a 
=0+0+a{ ydy +0=-—. 
‘ 2 
Thus, Stoke’s theorem, 


(V x A) ends. 


Aedr = I 
Square loop L Square surface bounded by L 


is satisfied. 
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PROBLEMS FOR CHAPTER 10 


1. Determine gradient (V) and Laplacian (V7) of rand 1/r wherer = jx? +y?2+4 z?. 


2. Determine the divergence and curl of the vector function F = = e, + > ey + = e,. 


3. lfr=xe, +ye,+ ze, is the position vector andr = /x* + y? + z? is its magnitude, then determine, 


(a) Ve(rr), 
(b) Vx(rr) 
(c) V2(r?). 


4. Consider a vector function A = In(xyz) e, + In(yz) ey + In(z) e,. For this vector determine 
(a)V ° A, 
(b) V XA, 
(c) V2A, 
(d)V(V * A), 
(e) V x (V XA). 
(f) Using these results, verify V x (V x A)=V(VeA)—V2A. 


5. Consider a vector function A= (x? — y*)e,+xyze, — (x + y + z)e, and a cube bounded by the 


y 
planesx = 0.x = 1,y = 0,y = 1,z2 = Oandz = 1. 
(a) Determine the volume integral iV e AdV, where V is the volume of the cube. 


(b) Next determine the surface integral S,A endsS, where S is the surface area of the cube. Compare with 


the result of part (a) 


6. Consider a vector function A= (2x — y) e, + yz? ey + y*z e,. Also, S is the flat surface area of a rectangle 
bounded by the lines x = +1andy = +2 and C is its (rectangular) boundary in the x — y plane. 
(a) Determine the line integral fA ‘dr 
C 
(b) Next determine the surface integral SV x A) en dS. Compare with the result of part (a). 


7. Biomedical Physics Application Entomologists have known that fruit flies are attracted to sugary substances 


and feed on overripe fruit as well as on spilled sodas. The fruit flies approach the fruit along the direction in 


190 


which the smell of the decaying fruit is strongest. The concentration of the smell of the rotten fruit at a point 


(x, y, Z), due to some overripe fruit at the origin (0, 0, 0), is, 
C(x,y,Z) = Co exp(—2 x? —y? + 27). 


Here x, y and z are in meters. If a fruit fly detects the presence of rotten food while it is at the point 


(0.6 m, 0.8 m, 0.9 m), determine the unit vector along the initial direction in which the fruit fly approaches the 


food. 
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Chapter 11: First Order Differential Equations 


In our previous discussion of calculus, we were given a known function F(x) and were asked to find either the 
derivative or the integral of this function. In some other situations, we are given the first, second or higher order 
derivative of an unknown function and are asked to figure out the function. That brings us to the domain of 
differential equations. Our aim in this chapter is to provide merely a flavor of differential equations, so we will 
confine our attention only to first order ordinary differential equations. The prolific topics related to differential 


equations, ordinary as well as partial, are covered in books which are entirely devoted to this subject. 
11.1 A FIRST ORDER DIFFERENTIAL EQUATION 


Consider the case where an unknown function y(x), which is yet to be determined, satisfies, 


dy _p a dy _ iy Eq.(11.1 
aus (x) or ag (y) or ago (x,y), qaC Ei) 


where the functional form of F(x) or G(y) or H(x, y) is known. Such equations are called differential equations. 
The order of a differential equation is the order of highest derivative appearing in the equation. An ordinary 
differential equation involves derivatives of a function of a single independent variable. All equations shown in 
Eq. (11.1) are first order ordinary differential equations. A partial differential equation, on the other hand, 
involves derivative of a function of multiple variables. The wave equation, that we encountered in Chapter 1, 


serves as an example of a partial differential equation since it contains the following derivatives, 


arf 10F 
ax? v2 At?” 


The wave equation is a second order equation. When a differential equation for the function y(x) is integrated, a 
constant of integration, C, will appear naturally as a part of integration. This constant will need to be evaluated 
using the constraints on the function y at some fixed value of variable x. Such constraining relationships are 
called the boundary conditions. If the variable in the differential equation is time, t, the boundary conditions are 


referred to as the initial conditions. 


Several first order differential equations can be solved by direct integration. The following examples will illustrate 


this point. 


Example: Solve the differential equation 


|x| <1, Eq. (11.2) 
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for the function y(x) with the boundary condition y(1) = 0. 


Solution: On integrating both sides of the differential equation (11.2) with respect to variable x, we get 


xdx LE 
y= | +. 
V1 — x? 
To do the integral, substitute 1 — x? = u?, or x dx = —u du, 


xdx udu 
——— = — | — = -u = -V 1 - x?. 


1—x2 u 
So, y = —V1 — x? + C is the solution of the differential equation and the boundary condition gives C = 0. 


Example: Solve the differential equation for y(x), 


dy 
= fy, Eq.(11.3 
ax ey. q- (1h) 


with boundary condition y(0) = 0. 


Solution: Since we have to solve for function y in terms of variable x, we separate out the x and y dependent 


parts in Eq. (11.3) and then integrate to get 


dy 
—._=/d C. 
[i | x + 


arctany=x+C. 
Or y = tan(x + C) is the solution of the differential equation. The boundary condition implies C = 0. 


d , : 
~ can be written in a separable form as 
x 


In some problems ae 


as h 
a g(x)h(y). 


Then, 


[S- [ seoax+ C. 
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Example: Solve the following differential equation for y(x), 


dy 9expx 
Ss Eq.(11.4 
de gz q. (11.4) 
with y(0) = 3. 


Solution: Write Eq. (11.4) in separable form and then integrate to get 


[o°dy = | 9expxax+ C. 


3 
7 = 9expx+C. 


On substituting boundary condition, C = 0. Thus, the function y(x) is, 
x 
y =3exp (5). 
11.2 INTEGRATING FACTOR 


Some first order differential equations are of the form 


d 
a + by = F(x), Eq.(11.5a) 


where a and b are constants. These equations are solved by introducing an integrating factor. The integrating 


factor for equations of this type is 


b 
exp (- x). 


First, we rewrite Eq. (11.5a) after dividing by constant a, 


eal Gt BE; Eq. (11.5b 
We ae q. (11.5) 


On multiplying both sides of Eq. (11.5b) by the integrating factor, the left-hand side becomes an exact 


differential, 
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apy ons) 
a ae a) ee 


Integrate both sides of this equation with respect to x to obtain the solution 


b 1 b 
—x)=- —x) F(x)d. G, Eq. (11.6 
y exp (=x) = | exp(22) (x)dx + q- ( ) 
where C, the constant of integration, will be determined by the boundary conditions. 


Example: An R- L circuit. The circuit consisting of an inductor of inductance L and a resistor of resistance R 
is connected to a source of alternating current of frequency w and voltage Vy cos(wt). Using Kirchhoff’s rule 


in physics, the current I(€) in the circuit satisfies the first order differential equation 
dl 
L Ae + RI = Vy cos(wt). Eq. (11.7a) 


Determine the current I(t) as a function of time t. 


Solution: In this case the differential Eq. (11.7a) is of the same form as Eq. (11.5a). So, the integrating factor is 
exp ¢ t). On multiplying the differential Eq. (11.7a) by the integrating factor and dividing by L, it becomes 

: (Fe) 1c - vo (;«) t 

= [exp L ( | = 7 exPlT cos(wt). 
On integrating both sides of this equation with respect to t using the standard integral, 


exp(at) 


(ae [a cos(bt) + b sin(bt)], 


| expcat) cos(bt) dt = 


we obtain 


R Vo . 
exp (; t) I(t) = cine aA cos(wt) +w sin(wt)| + C. 


(i) 


The constant of integration C is determined from the initial condition. If, for example, at t = 0,7 = 0, then 


exp(T) jr 
a E 


so that, finally, 
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Rt 
Vi il R ; V)R €Xp (- —) 
—_—_— - cos(wt) +w sin(wt)| = SS Eq. (11.7b) 


"@ +0 PETRY +09] 


Example: Molecular Isomerization. Some molecules with the same number of identical atoms can exist in 
several stable configurations or arrangements of their component atoms. Such configurations are called 
different isomers of the same molecule. For example, the molecule of propane-based alcohol, commonly 
called propanol, has two common isomers. The first isomer, 1-propanol, a primary alcohol, and the second 


isomer, 2-propanol, a secondary alcohol, have structural formulas as shown in the Figure 11.1. 


H 
H H H H 6 igi 
H-C-C-C-OH H-C-C-C-H 
HHH HHH 
1-Propanol 2-Propanol 


Figure 11.1. Two isomers of propanol. 


The propanol molecule is of significant importance in biological physics. Determine the ratio of populations of 


two isomers of the propanol molecule in equilibrium. 


Solution: A typical molecule isomerizes (or flips) between two configurations M, and M,. The probability that the 
molecule in configuration M, flips into M, in time At is proportional to At—the longer the time period At, the 
higher the probability. Thus, the flipping probability is k, At. The constant of proportionality k,, which 
represents the probability per unit time, is called the reaction rate. Stated differently, the rate of change of a 
single molecule from configuration M, to configuration M, is k.,.. lf there are N, molecules in configuration M, 
then rate of change of all molecules from M, to M, is k,.N2. Similarly, probability that the molecule in 
configuration M, flips into M, in time At is k_At, where k_ is the reaction rate for the reverse isomerization. The 


complete isomerization process can be expressed as 


ky 
M,=M, 
k 


Note that in this isomerization process a single molecule of first configuration is converted into a single molecule 


of second configuration. Thus, the total number of molecules, No, = N, + Nz, stays the same at all times. 
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Assume that in the beginning, at t = 0, N, = N;,; and N, = 0, which is the initial condition. The rates of change 


of N,, the number of molecules with configuration M,, and of N, the number of molecules with configuration 


M,, are 
dN, 
“ae. = —k,N, + k_N, = —(k, + k_)N> + k_ Neots 
aN, aN, 
ae = +k,N, _ k_N, => —(k, + k_)N, + ky Not => Toe 
Thus, the first-order differential equation satisfied by N,(t) is 
dN, 
Sane + (ky +k_)No = k_ Neot, Eq.(11.8a) 


which is of the same form as Eq. (11.5b). The integrating factor of this equation is exp[(k, + k_)t]. Therefore, 
d 
ae exp[(k, + k_)t]} = k_Nyor exp[(ky + k_)t], 


which on integration gives 


a HONEESE SG exp[—(k, + k_)t]. Eq.(11.8b) 
(ky +k) 7? eae 


Here C; is the constant of integration. Similarly, 


Ki Neot 


N,(t) = (k, +k.) +k) 


+ C, exp[—(k, + k_)t], Eq.(11.8c) 


where C, is the constant of integration. Using the initial condition, we get 


C,=-G) = c eeieae 
(ky + k_) 
Thus 
_ k_Neot Ki Neot 
N2(t) = Ci) +h) (k, + k_)t] 
and 
kN, 
N,(t) = p> [1 — exp[-(k, + kel]. 


(ky, + k_) 
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Note N,(t) + N2(t) = No; at all times. Also, N2(0) = Not, N,(O) = 0. Finally, after a sufficiently long time 


[t > co], when equilibrium is attained, 


k_N, kN, 
N,(00) = a N,(00) = ate. 
(ky +k_) (ky +k_) 
Thus, at equilibrium 
+ _ aN, dN, 
Ny eq = a tot» Noea= E+ Re Neot and ae 0 as well as ae 0. 
Also, note that at equilibrium, 
Ni eq = ky 
No eq k_’ 


that is, the ratio of population of two isomers is the same as the ratio of two rates. 


Example: Newton’s Law of Cooling. It is a common experience that if a glass of water at room temperature 


is placed outdoors on a hot day in summer, the temperature of water starts rising quickly. Similarly, if the 


same glass of water at room temperature is placed outdoors during a cold wintry night when the temperature 


has plunged due to the polar vortex, the temperature of water in the glass starts falling rapidly. Newton 
formulated the Law of Cooling by stating that the rate of change of temperature of a body, at any time f, is 
proportional to the difference between the constant temperature of the body’s surroundings, T,, and the 
temperature of the body, T(t), at time t. Mathematically, 

a, C[T, —T(®)] 

SG = CIs - TO). 
where C (C > 0) is the constant of proportionality. This is a first-order differential equation whose solution 
provides the temperature of a body, versus the constant temperature of its surroundings, as a function of 


time. Determine an explicit expression for the temperature of the body, 7(t), at any time t. 
Solution: The differential equation to be solved is 


dT(t 
_ CTO Ci Eq.(11.9a) 


If we define a new function, u(t), as u(t) = T(t) — T,, then 


du(t) _ du(t) _ 
Atace —C u(t) or rae —C dt. Eq. (11.9b) 
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A straightforward integration gives 

Inu(t) =—Ct+t+k, 
where k is the constant of integration. This equation can be rewritten as 

u(t) = T(t) —T, = exp(-C t + k), 
and can be rearranged as 
T(t) =T, + exp(—C t +k). 

The constant of integration, k, can be obtained by setting t = 0, to get 

exp(k) = T(0) —T;, 
so that, finally, 

T(t) =T, + [T() —T,] exp(-C t). Eq.(11.9c) 


Note that at a much later time, that is, £ > co, T(oo) = T, and the temperature of the body becomes equal to 


the temperature of its surroundings, as expected. 


11.3 APPLICATIONS OF NUCLEAR PHYSICS 


Without any doubt, nuclear physics has made tremendous contributions to modern medicine. Radiology and 
nuclear medicine involve the use of radioactive nuclei to diagnose, evaluate, and treat various diseases. The 
nucleus of any atom contains protons and neutrons. A single nuclear species, having unique values of the 
number of protons and number of neutrons, is called a nuclide. About 80% of the nuclides occurring on the Earth 
are stable nuclides; that is, they do not disintegrate, or decay, into other nuclides. On the other hand, about 20% 
of the nuclides are unstable, or radioactive, nuclides which can decay into other stable or unstable nuclides. In a 
sample of radioactive material, the number of radioactive nuclei continues to decrease with time as some of 
them disintegrate into other nuclides. In a radioactive transformation, the disintegrating nuclides are called 
parent nuclides and they decay into so-called daughter nuclides. Generally, the rate at which the parent 
nuclides decay varies from nuclide to nuclide. If N(t) is the number of radioactive nuclides in a sample, then the 
number dN of nuclides that will undergo disintegration in time dt will be proportional both to dt as well as to 
N(t). Saying this differently, the longer the time period dt, the more decays occur, and the larger the number N 


of nuclides available to decay, the more decays occur. Thus, 


aN =—AN at, 
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where the negative sign signifies a decrease in the number N. The constant of proportionality A is called the 
decay constant and it is different for different nuclides. Therefore, 

dN 

—+AN=0. Eq. (11.10) 

dt 
Example: Radioactivity. In a sample of radioactive material, the number of nuclides at t = 0 is N(0). 
Determine N(¢), the number of radioactive nuclides later at time t. Also, determine the time at which the 
number of radioactive nuclides has decreased to half of the number at t = 0. This time is called the half-life, 


T 1/2, of the radioactive sample. 


Solution: The first order differential equation satisfied by N(t) is 


dN =-aN 

dt : 
which can be rewritten as 

al =—-Adt 

N 


A simple integration of this equation gives 
InN(t)=—-At+C. 


Here C is the constant of integration. Its value can be obtained using the initial condition that at t = 0,N = 


N(0). So, C = In N(0). Therefore, 
InN =-At+InN(0), 


or 
N(t) = N(0) exp(—A t). Eq. (11.11) 


To determine the half-life, note that at ¢ = T,/2, we have N = N(0)/2, so that 


1 
3 = exp(—A T1/2)s 


or 
hp=>- Eq. (11.12) 
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Thus, starting from the original number N (0), the number of remaining radioactive nuclides after each 


ae Ta N(0) N(O) N(O) 
successive time interval of T1/2 ae are ee and so on. 


Example: Successive Radioactivity. A radioactive nuclide A decays into a daughter nuclide B with a decay 
constant of A,. The daughter nuclide B itself is radioactive and it decays into a stable nuclide C with a decay 


constant of Ag. Schematically, the nuclear reaction can be expressed as 


a A 
ASB — 6 


If at the beginning, that is, at £ = 0, the number of nuclides is, N,(0) = Ny and N,(0) = 0, determine the 


number of nuclides later at time t. 


Solution: Following the previous example, since the parent A decays only into the daughter B, the first order 


differential equation for N,(t) and its solution are 


eS Ne Eq. (11.13a) 
dt 
and 
N,(t) = No exp(—A, t). Eq.(11.13b) 


The daughter B is produced by decay of parent A and is lost by its own decay into C. So, the first order 


differential equation for Np(t) is 


aNpz(t) 
dt 


= A, Ny —Ag Np. Eq. (11.14) 


The first term on the right-hand side represents the rate of production of B and the second term represents the 


rate of decay of B. Substituting N,(t) from Eq. (11.13b) into the right-hand side of Eq. (11.14), we get 


aNp(t) 
dt 


+ Ag Np = 2,4 No exp(—A, t). Eq. (11.15) 


This differential equation is of the same form as Eq. (11.5b). In this case, the integrating factor is exp(Ap, t) and 


the solution of Eq. (11.15) is same as in Eq. (11.6), namely, 
Aa No 
Np(t) exp(Ag t) = Ay No [ expe —A,)t] dt+C= in exp[(Ag —A,)t] +C, 
BAA 
where C is the constant of integration. Using the initial condition for N,(t) at t = 0, 
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so that 


= [exp(—A, t) — exp(—A, t)]. Eq. (11.16) 
A 


A 
Na() = 7 


Eq. (11.16) provides the number of daughter nuclides as a function of time. On comparing the half-life of parent 
A with the half-life of daughter B, we arrive at two conclusions. First, if the half-life of the parent is much larger 
than the half-life of the daughter, it implies, using Eq. (11.12), that 2, is much smaller than Ag. Then, Ap — A, © 


Ag and exp(—A, t) is much larger than exp(—A, t), so that 


As No 
Np(t) = 7 exp(—A, f). 
B 
Using Eq. (11.13b), it means that 
aN, adNz 
A,N,(t) = ApNp(t) or a = mares Eq. (11.17) 


So, after a sufficiently long time, both the parent and daughter nuclides disintegrate at the same rate. Second, 
we conclude that if the half-life of the parent is much shorter than the half-life of the daughter, no equilibrium 
can be reached since the parent nuclide will disintegrate quickly, leaving behind a much longer-lived daughter 


nuclide. 
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PROBLEMS FOR CHAPTER 11 


1. Determine the function u(t) satisfying the differential equation 


Ce ae a 
dt cosu 
andu(O) = 7/2. 
2. Determine the function y(x) satisfying 
dy _ly 
dx 2% 


with y(4) = 1. 


3. Ona calculus exam, students were asked to use the product rule to determine the derivative of a function 


F(x) = f(x) g(x). A student forgot the product rule but mistakenly assumed that product rule was, 


dF _ df(x) dg(x) 


dx dx dx 


Luckily, the student got the correct answer. 


If one of the functions in the product was, g(x) = exp(3x), then set up aa first-order differential equation for 


the other function f(x). 
Solve the differential equation to obtain the second function f (x). 


4. Biomedical Physics Application. A glucose solution is administered intravenously into the bloodstream at a 
constant rate r. As the glucose is added, it is converted into other substances and removed from the 
bloodstream at a rate that is proportional to the concentration at that time. Thus, a model for the concentration 


C(t) of the glucose solution in the bloodstream is 


where k is a positive constant. 
(a) Determine the concentration at any time t if the initial concentration at t = 0 is Co. 


(b) Determine the value of the concentration at sufficiently long time (t — 0) after the administration 


begins. How will this value change if C, is doubled? 


5. Biomedical Physics Application. A device called a pacemaker can be implanted inside the human body to 


monitor the heart activity. The pacemaker is essentially a capacitor, with capacitance C, that stores some charge 
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at a certain voltage V. If the heart skips a beat, the pacemaker releases the stored energy through a lead wire of 
resistance R to get the heart back to beating normally. In a pacemaker circuit, the charge, Q(t), on the plates of 


the capacitor varies with time as 
Q 
R—+ —=Vexp(-At) 


whered = 2/(RC). If att = 0,Q(t = 0) = CV, solve the above differential equation to obtain Q(t) asa 


function of time, t. 


6. Biomedical Physics Application. On a clear wintry night, at midnight, when the outdoor temperature is 0°C, 
the body of an old man was discovered on a park bench. The temperature of the body at the time of discovery 
was determined to be 28°C. The body temperature of a normal healthy adult is 37°C. It took an hour to take 
the body from the park to the nearby hospital. At 1:00 AM, the pathologist doing the autopsy finds the 
temperature of the body to be 24°C and declares the cause of death to be hypothermia. Based on this 


information, determine the time of death for the old man. 


7. Biomedical Physics Application. In medical research, radioactive materials are helpful in identifying and 
treating diseases in a noninvasive manner. In a chain of successive radioactivity, a radioactive nuclide A decays 
into a daughter nuclide B and the daughter nuclide B decays into a stable nuclide C. The decay constants (or 
half-lives) of both the parent nuclide and the daughter nuclide are equal. If at ¢ = 0 the number of nuclides is 
N,(0) = N, and N,(0) = 0, then determine the time when the population of the daughter nuclide, B, is 


largest. Express this time in terms of the half-life of either nuclide. 
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Chapter 12: Diffusion Equation 


Let us start our discussion of diffusion through some everyday observations. Imagine placing a sugar cube in a 
glass of water. After a few hours, you notice that sugar cube is no longer there; it has become part of the water 
and now the water tastes sweet. The sugar cube has dissolved due to diffusion. This process is similar to the 
process of gas exchange in multicellular organisms. Other examples of diffusion include noticing the smell of 
spray perfume or cigarette smoke even when the person using perfume or smoking is far away from the 


observer. 
12.1 FICK’S LAW 


The process of diffusion refers to the movement of a substance, or more accurately its atoms and molecules, 
from a region of higher concentration to a region of lower concentration. Diffusion is important in biology in 
transporting biomolecules from one location to another. Thermal energy can spontaneously move the molecules, 
and this spontaneous motion is governed by the diffusion equation. Diffusion plays a role in cellular transport 
and membrane permeability, and in determining conformation of certain large biomolecules, along with other 
applications. The rate of enzymatic reaction in many cases is also diffusion-limited as it determines the time it 


will take the reactant molecules to come closer and interact with each other. 


To begin with, let us consider the flow of diffusing particles back and forth in one dimension (say, along the x- 
axis) across a surface S during a time t. The number of particles flowing across is directly proportional to time t 
and to area S. We define net flux, J,, as the number of particles flowing per unit area per unit time. It was 
empirically noted by Fick that the net flux is proportional to the rate at which the concentration of particles 


varies with distance (that is, the gradient of the concentration). In other words, Fick’s law states 


where C is the concentration (that is, number of particles per unit volume). The negative sign is indicative of the 
direction of the flow of particles. The direction of flux, J,, is the direction of increasing x and of decreasing 
concentration, C. The constant of proportionality D is called the diffusion constant and it has dimensions of 


[L? /T]. Similarly, the flow of particles along y and z directions satisfy 


__ pat 
Jy ay’ 
_ ac 
Je Oz 
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So, in three dimensions, we can write Fick’s law as 


J=—-DVC. Eq. (12.1) 
Equation of Continuity 
In order to study the diffusion of particles in three-dimensional space, let us break the whole space into tiny 


boxes. Over time, as particles in the whole space diffuse from one box to another, the total number of particles 


in the whole space remains fixed. In other words, 


N= C(x, y, z, t)dxdydz Eq. (12.2) 


whole space 


is constant, independent of time. 


J (x+ ax) 


Figure 12.1. Particles diffusing into and out of a box. 


Let us consider flow of particles through a box of sides Ax, Ay and Az between times t and t + At. The volume of 
this box is AV = AxAyAz. The concentration of particles in the box at time t is C(x, y, z, t) and the number of 
particles in the box at time t is n(t) = C(x, y, z, t)AV. Similarly, the concentration of particles in the box at a 
later time t + At is C(x, y,z,t + At) and the number of particles in the box at later time t + At is n(t + At) = 


C(x, y, z,t + At)AV. Now using the definition of flux as the particles flowing per unit area per unit time, we get 


[ex + Ax) — Jy(x)]AthyAz+[Jy(y + Ay) — Jy) 1Ataxhz+ [J,(z + Az) — J, (z)|athxdy 
= —{n(t + At) — n(t)}. 


The negative sign on the right-hand side signifies that if more particles diffuse out of the box than diffuse into the 
box, then the number of particles in the box at the later time will be less than at the earlier time. Similarly, if 
more particles diffuse into the box than out of the box, then the number of particles in the box at the later time 


will be more than at the earlier time. Now, keeping only first-order terms in the difference of fluxes, we get 
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(*) Ax| AtAyAz + (2) ay| AtAxAz + (32) az| AtAxAy = —{C(x,y, z,t + At) — C(x, y,z, t)}AV. 


On dividing both sides by At and AV, we get 


OJ, Ay Oz _ _ Cy, 2,t + At) — C(x, y, z, t) [ OC 


Ox Oy Oz At at’ 
or 
V-J= us Eq. (12.3 
ieee q. (12.3) 


This is the equation of continuity. Physically, it represents the conservation of total number of particles, namely 


N, independent of time, as in Eq (12.2). 
12.2 DIFFUSION EQUATION IN THREE DIMENSIONS 


Combining the equation of continuity with Fick’s law gives the diffusion equation, 


a2c a2C a2C] AC 
= Eq. (12.4) 


DWC =D |-—~+—>+—>|=—. 
I Oy? dz? ot 
This is a time-dependent second order partial differential equation. Solving such an equation is beyond the scope 


of this book. The solution of the diffusion equation provides the time dependence of the concentration of the 


particles at any point in space. 
Solution of Diffusion Equation 


The solution of the diffusion equation, after solving the second order partial differential equation, is 


CCe i, 2) = 


Ree dk ab oe 
eee Eq. (12.5a) 


- 
————~ exp 
(V4nDt) 4Dt 
Let us verify that this indeed is the solution of the diffusion equation. The first and second derivatives of C with 


respect to x are 


aC N x? + y?2 42? x 
i ae | ae | Oa 


aC N +y? +2? x \2 1 
a Geet 4D (am) +ent~l(-zp9)} 


207 


or, with r? = x? + y? 427, 


a7c N aa x7 1 

0x2 (V4nDt) XP) 4Dt (2Dt)2 _2Dt}’ 
Similarly, the second derivatives with respect to variables y and z are 

a7C N r? y? 1 

ae et Se eee oa 

Oy? (V4nDt) 4Dt||(2Dt)* 2Dt 
and 

a7C N r? z 1 

aaa ee al oes Se: 

Oz? (V4nDt) 4Dt||(2Dt)* 2Dt 
Thus, combining all the derivatives together, 


wipe N a ee 3 
~ (Vanbey | ADE) (De? 2De) 


Also, the derivative of C with respect to time, t, is 


a = exp|-o N fe r? 
ae | 4De| (ae) IW Pe 20D 


_ N r? r? 3 

7 (V4nDt)’ “XP! apt| |4De2 _ 2t" 
On placing all these derivatives in the diffusion equation, Eq. (12.4), we note that the equation is indeed satisfied. 
Notice that the concentration C(x, y, z, t) of diffusing particles, Eq. (12.5a), is a spherically symmetric function, 
namely, it can be rewritten as 


CGD) = Eq. (12.5b) 


N Te | 
————~ exp |— —_]. 
3 
(V4nDt) aoe 
Here r is the distance from the point of release, or the source point, of diffusing particles. So, starting at the 
source, the diffusing particles spread out in a radial manner if there are no extenuating conditions. As time t 
increases, the spreading out of the particles increases the spatial extent of diffusion. However, the total number 


of particles in the whole space stays the same, namely, N. Explicitly, 
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I | C(x,y,z,t) dxdyd | CO ey ea i | r | 2d 
= X,y,Z,t) axayaz = r,t) 4nr¢dr = ————, | exp|———| r2dr. 
(V4nDt)’ J ADt 


whole space whole space 


Make a change of variables from r to u = r/V4Dt, so that 


4nN af 4nN\Va 
I = ——,, (4Dt)2 | exp (—u?) u? du = —,;-—- = N, 
(V4iDt) J nz 4 


independent of time, t. In the last step we used the integral C [see, Eq. (2.16)]. The quantity V4 Dt is called the 


diffusion length, Lp. 
12.3 DIFFUSION EQUATION IN ONE DIMENSION 


For diffusion of particles in one dimension only (say, along the x-axis), the function describing the concentration 


of particles (that is, particles per unit length) at location x, at time t, is 


2 


exp - =| = C(0,t) exp | 


2 


Lal 


N 
Vit Ly 


C(x,t) = Eq. (12.6) 


This is a bell-shaped Gaussian function that we encountered previously in Figure 4.5 and its related discussion. 


N 


TELp’ Just as in the 


The Gaussian function of Eq. (12.6) has its maximum, central, value of C,,,,(t) = C(0,t) = 


three-dimensional case, the total number of diffusing particles N does not change with time for the one- 


dimensional case of Gaussian concentration. Explicitly, 


2 


fe ye { an 
x, x = ——__ | exp|——]| dx, 
: Vily J 


or, on changing the variables from x tou = x/Lp, 


r Nf N 
J C(x, t) noe | exp[—u?] a= tN, 


independent of time, t. The maximum value of the concentration function C,,,,, is a function of time and it 
decreases with time. So, to keep N fixed the concentration spreads out in space. The extent of values of x, 
namely Ax, over which the function C(x, t) is appreciable is given, at any time, by the full width of the function 


at half of its maximum value (FWHM). Thus, 


Ax = FWHM = 2VIn2 Lp. Eq. (12.7) 
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Note Lp increases with time; that is, the extent to which certain particles diffuse in time t is proportional to Vt. 


Also, the time dependence of the concentration 


comes through the time dependence of the diffusion length Lp = V4Dt. For t > 0, the concentration of the 


diffusing particles is represented by a Dirac Delta function; that is, C(x, 0) = N d(x), using Eq. (I. 5b). 
12.4 FINAL COMMENTS 


We note in passing that the diffusion equation described here is same as the heat equation in thermal physics. 
That is not surprising since the heat equation describes diffusion of heat in a solid. In the thermal case, the 
concentration C(r,t) is replaced by temperature, T(r, t), at any point; thermal energy diffuses from a region of 


higher temperature to a region of lower temperature. 


It is worth pointing out that the process of diffusion is also very important in industrial settings, such as in 
metallurgy. For example, to convert ordinary iron into steel we must add carbon to it, a process called 
carburizing, which hardens the surface of iron. Parts of iron in a high temperature furnace are placed in contact 
with carbon-rich gases and carbon diffuses into the parts, following the diffusion equation, turning iron into 
hardened steel. More generally, the process of diffusion coating, in which some elements are diffused onto the 
surface of metals to improve their hardness and corrosion properties, has been used in nonstick Teflon coatings 


of kitchen pots and pans. 
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PROBLEMS FOR CHAPTER 12 


1. Consider a diffusing gas whose source is located at r = 0 and its diffusion constant D is 4 x 10~* cm?/s. At 
t = 0, the source spews out 1000 particles localized in the air. Plot the diffusion length Lp as a function of time 


t. What is the rate of growth of the horizontal area (A = 717.) filled by the diffusing particles? 


2. Again, consider the diffusing gas of 1000 particles and the diffusion constant of D = 4 x 10~* cm?/s. For 
diffusion in one dimension and the source located at x = 0, plot the concentration C(x, t) as a function of x for 


t = 1s,4s and9 s. What is the area under the C(x, t) versus t curve for each value of time t? 


3. Biomedical Physics Application. A truck carrying hydrogen fluoride, a toxic gas, gets into an accident on the 
road, causing leakage of about 5,000 molecules of the gas. If the diffusion constant of hydrogen fluoride is 
1.7 X 10-5 cm? /s, determine the concentrations of the toxic molecules at the site of the accident 6, 12, 18, and 


24 hours after the accident. 


4. Biomedical Physics Application. A researcher in a virology laboratory is working with a deadly virus ina 
sealed test tube. The laboratory is in the shape of a cube of side length 12 m. The researcher is at the center of 
the floor of the laboratory when the test tube is accidently dropped, spilling the deadly virus. As a result, the 
laboratory will need to be sealed and later clinically cleaned. If the diffusion constant of the virus is 


9 X 10-3 m?/s, how long will it take the virus to spread out to cover the whole floor of the laboratory? 
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Chapter 13: Probability Distribution Functions 


In various branches of science, we often deal with physical variables, which are measurable and controllable 
quantities such as momentum and kinetic energy of a particle or rate of a chemical reaction, etc. The values of 
physical variables are determined by some physical or chemical property such as mass of the moving particle, 
temperature and concentration of chemicals, and others. On the other hand, a variable whose value is 
determined by some random phenomenon is called a random variable. Examples of random variables are the 
number of babies delivered in a hospital per day or the periods of rotation of various planets around the Sun, 
and so on. These quantities are measurable but not controllable. Typically, the outcome of an experiment is an 
event or number which could belong to either a set of discrete possibilities or to a continuous range of possible 
values. Flipping a coin or rolling a die are examples of experiments in which the outcome is discrete. On the other 
hand, observing the change in outdoor temperature in a day from morning to evening or measuring the speed of 
an accelerating car starting from rest leads to continuous outcomes. A probability distribution function is a 
mathematical way of associating the different outcomes in an experiment with the probability with which they 


occur. In this chapter we will learn about discrete as well as continuous probability distribution functions. 
13.1 A DISCRETE PROBABILITY DISTRIBUTION FUNCTION 


A pharmaceutical company assembles a group of twenty volunteers to test its new vaccine. Before starting the 
testing, the pharmaceutical company needs to get the vital signs of all volunteers. These include the height, 
weight, body temperature, and pulse rate of each volunteer. The heights of individual persons are, in inches, 60, 
61, 62, 62, 64, 64, 64, 65, 65, 65, 65, 65, 65, 66, 66, 66, 68, 68, 69, and 70. To get the average or mean height, we 
simply add all the heights and divide by the number of persons, as 

i _ 60+ 61462 +++ 70 _ 1300 


0 a0F = 65 inches. 


[Note in passing that a variable with a bar on it is used to indicate the average or mean value of that variable.] 


Alternatively, we could organize the persons who have the same height into groups and write 


5 _ 60 + 61 + 2(62) + 3(64) + 6(65) + 3(66) + 2(68) + 69 + 70 


0 = 65 inches. 


If there is a very large number of persons, say N, whose average height we need to find, we will first note that N, 
persons have height h,, N, have height hz, N3 have height h3... and N,, have height h,, such that N, + Ny +++: + 
N, = N. Then, 
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5 a Nal + Note + Nols ++ Nalin 


N 


where, 


=fih, t+ foro + fahg +--+ fuln, Eq. (13.12) 


N; 
a ar is the fraction of persons with height h;. Eq.(13.1b) 


Thus, 


n 
h= >. fit, 
i=1 


(Eq. 13.1c) 


where nis the number of different height groups. If instead of height, we are measuring a different vital sign, 


say, temperature of the volunteer, T, then, 


or pulse rate, P, 


or weight, w, 


The number of groups 7 is different for different variables. In all these cases, the variable [height, temperature, 


weight, etc.] is a random variable and has discrete values (since the number of possible values is, at most, N). 


Also, note )i_, f,=1 in all cases, since all fractions of the sample (which, in this example, is the number of 


volunteers) must add up to 1. This is referred to as the normalization condition. Note that these fractions are 


measures of probabilities. For example, assume that in this cohort consisting of 10 males and 10 females, 14 


volunteers are given the actual vaccine and the remaining six volunteers are given a placebo (a harmless 


medicine with no physiological effects). Males are given 4 placebos and 6 vaccines while the females are given 2 


placebos and 8 vaccines. Then we can sort all the volunteers into four groups as shown in the following table. 
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With placebo With vaccine Total 


Male 4 6 10 
Female 2 8 10 
Total 6 14 20 


The outcome of any measurement on this group is an event or a number which is discrete. Now, if a person is 
randomly chosen from this group, the probability of choosing a person who was given the actual vaccine will be 
14/20 and probability of choosing a person given a placebo will be 6/20. The probability of choosing a male 
volunteer will be 10/20 and probability of choosing a female volunteer will also be 10/20. In general, if we are 
dealing with two or more events, then we need to find the joint probability that these events will occur 
simultaneously. If P(X) is the probability of occurrence of event X and P(Y) is the probability of occurrence of 
event Y, then joint probability of occurrence of both events X and Y simultaneously, if they are independent 


events, is 


P(X and Y) = P(X) P(Y). Eq. (13.2a) 


On the other hand, if the two events are not independent and their outcomes are related by some conditions, 
then we talk about conditional probability. By definition, P(X|Y) is the probability of occurrence of event X, 
given that event Y has already occurred. Similarly, P(Y|X) is the probability of occurrence of event Y, given that 


event X has already occurred. The rules for finding the conditional probabilities are 


P(X and Y) P(X and Y) 


P(XIY) = 0 and P(Y|X) = ea 


Eq. (13.2b) 


If X and Y are independent events, it means that occurrence of event X has no effect on the occurrence of event 
Y and vice versa and, therefore, P(X|Y) = P(X) and P(Y|X) = P(Y). Then, Eqs. (13.2b) reduce to Eq. (13.2a) for 


independent events. In case of conditional probabilities, it follows from Eq. (13.2b) that 


P(X) P(Y|X) = P(Y) P(|Y), Eq. (13.2c) 
which is called Bayes’ theorem in statistics. This theorem provides the probability of an event, based on prior 
circumstances that might affect the event. Going back to the example of cohort with 20 volunteers, assume that 
event X is choosing a person with the vaccine and event Y is choosing a female person. Then P(X) is - and P(Y) 
is = Also from the table, P(X and Y) is =. Since P(X) multiplied by P(Y) is not equal to P(X and Y), it follows 


that events X and Y are not independent and, therefore, it is necessary to calculate the conditional probability. 


The probability of choosing a person with a vaccine, given that a female person has already been chosen, is 
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8/20 


P(X|Y) = ra 0.80. Similarly, the probability of choosing a female person, given that a vaccinated person 
has already been chosen, is P(Y|X) = En = = 0.57. It is easy to verify that Bayes’ theorem is satisfied in this 


case. 


If we have the average value x of a random variable x in a sample size of N, it does not provide any information 
about the spread of the values of x. To measure the spread of values, we define the variance and 

standard deviation of x. To quantify the spread of values of the variable, we measure the deviation of each 
data point from the mean value, namely, x; — x. The value of this deviation, for any individual data point, can be 
either positive or negative, and the simple sum of all of these deviations will be zero. That is so because, by 


definition of the average, 


So, to define a variance and a standard deviation, a, we first find the average value of the squares of deviations 


over all data points. This is called variance and its square root is the standard deviation. Explicitly, 


variance, a” = Vit filxi — *)? = Lika fix? — 28 Diy fix + [X1? Dia fi 


Eq. (13.3 
=ha oe el ee ease 


and 


standard deviation, o = |x? — [x]?. Eq. (13.3b) 


Roughly speaking, variance is an overall measure of how spread out the x; values are from their mean value x, 
while standard deviation is used when considering how an individual data point differs from the mean. In 


particular, the standard deviation is used when determining the statistical significance of an experimental result. 
13.2 BINOMIAL DISTRIBUTION FUNCTION 


Consider an experiment that has only two possible outcomes, which are mutually exclusive and can be thought 
of as a success or a failure. Examples of such experiments are tossing a coin or rolling a die. In case of coin- 
tossing, if getting a head is a success, then the probability of success is % and probability of failure is also %. 
While in rolling a die, if getting a 4 is a success, then probability of success is 1/6 and probability of failure is 5/6. 
The binomial distribution provides the probability of observing n successes in N independent trials if the 
probability of success in a single trial is p. The success probability p is the same for all trials. The binomial 


distribution is 
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P(n,N,p) = pyre, Eq. (13.4) 


ae 
Ae a 


Example: In the maternity ward of a hospital, young moms can deliver either a boy or a girl with equal 
probability. On a particular day, six pregnant women are admitted in the hospital. What is the probability that 
only one boy will be delivered on this day? Or, two boys will be delivered? Or, three boys will be delivered? Or, 
four boys will be delivered? Or, five boys will be delivered? Finally, what is the probability that the six babies 


delivered on this day will be either all boys or all girls? 


Solution: On this particular day six experiments are being done in the maternity ward and each experiment has 
two possible outcomes, either a boy or a girl. In this case N = 6. Assuming that delivering a boy is a success for 
the experiment (no sexism intended!), then p = %. Using the binomial distribution, the probability of delivering 


only one boy (that is, n = 1) is 


P(1,6, 0.5) = eos) (5) eer 


The probability of delivering two boys (that is, n = 2) is 


Bee 0S) ae) (3) = = 


Similarly, probabilities of delivering three or four or five boys are 
P(3,6,0.5 ae P(4,6,0.5) = d P(5,6,05)=~-=— 
(3,6,0.5) == 75, (4,6, 0.5) == an (5,6, 0.5) = =F = 35. 
Finally, the probabilities of delivering all six boys (n = 6) or all six girls (n = 0) are 
1 1 
P(6,6,0.5) = a and P(0,6,0.5) = — 


64 


Note that the sum of probabilities of all possibilities adds up to 1, that is 


6 
>. P(n,6,0.5) = 1, 
n=0 


which is the normalization condition. 
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13.3 POISSON DISTRIBUTION FUNCTION 


Let us, once again, consider an experiment which has two possible outcomes, but now we do not know what the 


probability of success, p, is in a single trial. If we do this experiment for a large number of times (NV — 009), then 


we can find an average rate of success. For example, if we toss a coin millions of times, we will infer that average 


rate of getting a head is %. So, instead of dealing with p, we deal with average rate of success, namely, Np = A. 


So, we replace p by A/N and take the limit N — 09 in the binomial distribution to get 


n 


P(n,a) = li N! (=) (: -) 
MA) = ye nl (N =n)! WW. N 
To evaluate this limiting expression, we look at it part-by-part. For example, 


fin ME op NED Want (M) (N= 1) (N=n 41 
Now (N—n)IN® Now ig = Jim ( )( jet ) 
and 


—n 


im (1-4) =1 
pa ee 


Also, in precalculus, Euler’s number, e, is introduced as the limit of (1 + 1/N)% as N approaches infinity. The 


value of this constant is e = 2.71828. So, 


N 


lim (1 a ee 
Jim ( a 7 
Thus, 


n 


P(n,d) = ~ exp(-A) ; Eq. (13.5) 


which is known as the Poisson distribution. The Poisson distribution is appropriate if the average rate at which 


certain identical events occur during a time period is known. Furthermore, all events occur independently; that 


is, occurrence of one event does not affect the probability of occurrence of the next event. The Poisson 


distribution provides the probability that n events occur during a certain time period where n = 0, 1, 2,-:-. Note 


in passing, 


> P(n,A) = exp(+A) exp(—A) = 1, 
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which is the normalization condition. 


Example: In several parts of the world, sightings of tornadoes are a common occurrence. According to the 
historical records of the National Weather Service of the USA, in the state of Michigan the average number of 
tornadoes is 15 per year, with the largest monthly average of three tornadoes occurring in June. With this 
information, what is the probability that there will be exactly one or two or three or four tornadoes in 


Michigan next year in June? 


Solution: Since the average rate of tornadoes in June is three, A = 3. The probability of n tornadoes in June is 


n 


3 
P(n,3) = ai CxP(-3) - 


Thus, probabilities of having one or two or three or four tornadoes next June are 


1 


3 
P(1,3) = F,exp(-3) = 0.149, 


2 


3 
P(2,3) = S-exp(-3) = 0.224, 


3 


3 
P(3,3) = Zexp(-3) = 0.224, 


4 


3 
P(4,3) = Frexp(—3) = 0.168. 


13.4 MAXWELL-BOLTZMANN (CONTINUOUS) DISTRIBUTION FUNCTION 


Now suppose we wish to find the average speed of molecules in a room. It will be extremely difficult to do since 
the number of molecules is immeasurably large and difficult to count. Even if we only wish to find the fraction of 
molecules with a particular speed v;, their number will be enormously large. In this case, we break the whole 
possible range of speeds into small groups or intervals Av and concentrate on fraction of molecules with speeds 
between v; and v; + Av. Then, the fraction of molecules in this interval is f(v;)Av. The function f(v;) is called a 


distribution function or probability density function (PDF). In this case, 
n 
v= >. vif wodv, 


t=1 


and, if Av — 0, the variable v becomes a continuous random variable, 


218 


SI 
Il 


vf (v) dv. Eq. (13.6a) 


whole range of v 


Also, since all fractions must add up to 1, 


f(v) dv =1. Eq. (13.6b) 


whole range of v 


This is the normalization condition for the distribution function. This is equivalent to the notion that we can be 


100% certain that an observation will fall into the range of all possible outcomes. 


The velocity distribution of molecules of mass m in a gas at temperature T is the Maxwell-Boltzmann 


distribution. It is a continuous distribution function given by 


3/2 2 
fv) = 4nv? — ) ex (—Fr), Eq. (13.7) 


where k is the Boltzmann constant. We note 


‘a r 5 
[ rooms = a(S (= sat) [ e(-3 =) % dv. 


To evaluate this integral, we make a change of variables. Let u? = sat oru= [= v and du = [= dv. 


Then, 
r m 3/2 (2kTy\3/? Ni 
[ fe@av = 4n == (—) [ ex u’) u2du = 4n — Pag = i, Eq. (13.8a) 
0 0 


as expected, since this is the normalization condition. Here we used the integral iG from Eq. (2.13). We can also 


find average values of v and of v2 as 


00 ‘i [oe] 

a = 3 

b= | vf(v)dv = 4n( (= san) [ e(- sar)’ dv. 
0 


0 


2 
Again, we make a change of variables. Let t = — and dt = — 2vdv, so that 


1/2 


Fe (ee ee ea t)tdt=2 ea [ ede = = Eq. (13.8b 
a Ga 2\m ean ~ “\int an ~ \anm ; CCD) 
(0) 0 
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and 


co 


“i r 
2 — 2 = 4 
v fe v*f(v)dv = Are ( (= sat) [ ex(- vr) dv. 
0 


0 


2 
With the change of variables, u? = —~, 
2kT 
- m \3/2 /2kT\ 5? Sn Be DEB 
v2 = An = (—) | exp(—u?) u*du = a1 Gee at eta a — Eq. (13.8c) 


0 


Note in passing that =mv? = — that is, average kinetic energy per molecule in a gas depends only on the 


temperature of the gas. 
13.5 NORMAL OR GAUSSIAN (CONTINUOUS) DISTRIBUTION FUNCTION 


Another useful continuous probability distribution function is the Gaussian, or normal, distribution function. A 


random variable x has normal distribution if -co < x < 00 and the corresponding probability density function is 


fG)= —ex|-5 ( 5 J]. Eq. (13.9) 


By definition, f (x) dx is the probability that value of variable x lies between x and x + dx. First, we verify the 


normalization condition, namely, i f(x) dx = 1. The integral is 


free = aa! ex 


To evaluate this integral, we make a change of variable from x to u using u = oe orx=ou+tu, and dx = 


o du, 


u? 1 


ere we used t e integra rom cq. : : ext, we Ting average values of X and oT xX using t e same change 
H dthe i | 19, from Eq. (2.14). N find lues of x and of x2 using th h 


of variable from x to u, 


L= [ -Feodx = Jame E =) dx 
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co 


‘bof u? ri i ON oe 
ae [ oo wee S| =0+ T= exp| 7 du =u, Eq. (13.10b) 


and 
— r r 1 1 /x-— py? 
2 = 2F(x)dx = 2____exp|—— d 
x | feo I°aEe 5 (—) | a 
= Se (u2o07 + 2ucu + py”) exp - | du = eee ore +0+p?2V2z} 
v20 7. 2 V2 
or, 
x? =o07 +p. Eq.(13.10c) 
Then, 
variance = x? — [x]? = (0? +p?) — yw? =o? 
and, 


standard deviation = o. Eq. (13.11) 


So, now we can interpret the normalized Gaussian distribution function. If a variable x is normally distributed, 
then f(x) dx is the probability that x lies between x and x + dx. The function f(x) has a mean value of and 


standard deviation of o. If we change variable from x to u = = then, for the normal distribution, 
f (x) dx = F(u) du, 


where 


1 u? 
F(u) = a? (- 5) : 


We note that Fw) is the same Gaussian function that we encountered in chapters 4 and 12. It is worth observing 
that F(w) is a symmetric function of u, that is, F(—w) = F(u). Also, F(u) is a normalized function, that is, 
‘a F(u) du = 1. The quantity F(u) du is the probability that u lies between u and u + du. The probability that 


u lies between a and b is 
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For given a and b, the numerical value of this integral is obtained from the tabulated values of the Gaussian 


function, (x), defined as 


x 


@(x) = Ge du = — | exp (- ) du. Eq. (13.12) 


The tabulated numerical values of (x), as a function of x, are given in Appendix D, in which the shaded area 
under the Gaussian curve is the value of ®(x). Note ®(co) = 1, (0) = =, @(—c) = 0. Also, due to the 


symmetric nature of F (1), 


@(-x) = | Fu) du = — [ P(-v) dv = [ FC) av = [ F@av- [ roa 
@(—x) = 1-— (x). Eq. (13.13) 


This relationship allows us to find ® for negative values of x in terms of ® for positive values of x. So, in the 


standard table of Appendix D, the function ®(x) is tabulated for positive values of x only. Also, if b > a, then 


; Aas 1 ° u? 5 
Ge) Er en(-5) u 
b a 
Eels nae -— | ae) Eq. (13.14) 
oe 5 u Tea) 5 uUu= a). q. (13. 


If the range of value of x is from x, to xq (x, < Xx < Xq) instead of from —oco to o0, then 


1 a - = ay f <x< 
f@)= Ne Silas a Eq. (13.15a) 
oN 0 otherwise 


This is referred to as the truncated normal distribution. The scaling constant N is determined by the fact that 


ties f(x)dx = 1, or, 
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Making a change of variable from x to u = — 


2 


1=v— | ~~) du = N[e(u,) - &(u,)] 
= Vin 5 exp( ) = (uy) — P(u,)], 


or 


1 


Example: The grade point averages (GPA) of a large population of students at a university are normally 


distributed with a mean of 2.4 and a standard deviation of 0.8. The students with GPAs higher than 3.3 receive 


honors credit while students with GPAs less than 2.0 do not receive the degree. 


What percentage of students receive honors credit? 


What percentage of students can never graduate? 


Solution: Let x be GPA of students, 0 <x < 4. 


Xy-eb 4-24 


xy = 4.0,uy = = — 08 = 2.0, 
X,-H 0-24 

x, =0.0,u, = a a 5g = 730, 

1 1 1 


N= $20) -8(-30) 62) —[1- 830) H2) +030) —1 


1 1 1 
~ 0.9772 +0.9987-—1  1.9759-1 0.9759’ 


(a) 


oO 


4.0 4.0 
PapeenEneorh tudent ee ee | (—*) | 
ercentage O onors stuaents = x)aAX = —. ex =—— Xx. 
: 3.3 a 2n J . 


4.0-2.4 


3.3-2.4 9 ence 
= 5, upper limit of u = oa 2. 


0.8 


Letu= —* then lower limit of u = 


Z 
peice | | sl 
ercentage = —— ex = Uu 

5 Van J. ie 
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9 
_— P2)-@ (3) _ 0.9772 — 0.8686 _ 0.1086 


= — NBT 8 EE = = 0.11 oF 11%. 
B(2) — B(-3) 0.9759 0.9759 rane 


(b) 


2.0 


2.0 
Percentage of failing student [ Food v— | 5 (—) | 
ercentage of failing students = x)dx = N——]| exp|--~ x 
g g J avon | p ag 
Letu= ~ lower limit = 22 = —3, upper limit = BOSE fae) 6. 
o 0.8 0.8 
-0.5 


Barcent n— | | sl 
ercentage = —— ex =——— U 
g a J p|-5 


= N[@(-0.5) — &(-3)] = N[1 — (0.5) — 1 + &(3)] = N[®(3) — &(0.5)] 


_ 0.9987 — 0.6915 _ 0.3072 


= —— WAsEO, 2 = Moco = 0, 
0.9759 0.9759 0.31 or 31%. 


Example: The U.S. Centers for Disease Control and Prevention has reported several cases of Salmonella 
outbreak linked to guinea pigs. A veterinary research physician sets up a trap to catch a large number of 
guinea pigs. If the mean width of guinea pig’s shoulder is 4 = 3.8 inches with a standard deviation of 0.6 inch, 


and if the door of the trap is 5 inches wide, what percentage of guinea pigs will pass through the door? 


Solution: Let x be the shoulder width of a guinea pig, so that 0 < x < 00. The percentage of guinea pigs with 


shoulder length less than 5 inches is 


1 


5.0 1x — py? 
i == Jo pa =3( o ) Jax 


| 
fail C=) 


Percentage = 


Make a change of variables from x to u = a 


5-3.8 
0.6 


0-3.8 


Forx =0,u =0oo.Forx =0,u= =—Forx = 5.0,u= = 2.0. 


1 fc. £4 d 19 
V2 °~19/3 as 2104 -@OQ)= &(- =z) 
Percentage = <a Ce eras 8 
Ta l-1973 exp |-> du &(w) — &(— =z) 
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_ &(2) -[1- 9(6.33)] _ (2) + (6.33) -1 _ 0.9772 +1-1 


“@@)S [t= eels) CO) eeesI=1° Asia. 


or 97.7% guinea pigs will fit through the door. 
13.6 A FINAL REMARK 


As a final remark, it is worth pointing out that concepts related to probability distribution functions are central to 
quantum physics. The initial formulation of quantum physics using matrix methods led to the understanding of 
eigenvalues and eigenfunctions of matrices as representing physical observables. The gist of quantum physics is 
to talk about possibilities and probabilities in the measurement of physical properties, such as energy, 
momentum, angular momentum, and so on, at the scale of atomic and subatomic particles. The eigenvalues 
represent the possibilities and the eigenfunctions (or, their modulus square) represent the corresponding 


probability distribution functions. 
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PROBLEMS FOR CHAPTER 13 


1. To explain the conduction properties of metals, Paul Drude provided a simple model of a metal as a reservoir 
of free-electron gas. Even though Drude’s model was successful in explaining the classical Ohm’s law, it failed to 
account for the quantum behavior of electrons. The Exclusion Principle in quantum physics asserts that no more 
than two electrons can occupy the same atomic orbital. Classically, particles of a gas at absolute zero 
temperature will all have zero kinetic energy. In quantum physics, all electrons in an electron gas cannot be in 
ground state since that would contradict the Exclusion Principle. At absolute zero temperature, the fraction of 


electrons in a metal with energies between E and E + dE is given by 


CVE dE forO <E<E, 
0 for E > E, 


, 


f(E) dE = { 


where E;, is a constant (called the Fermi energy). 
Determine C if this energy distribution is normalized to 1. 
Find the average energy of electrons in terms of E;. 


2. A one-dimensional classical harmonic oscillator of mass m and frequency w is oscillating along the x-axis with 
its equilibrium point at x = 0. The turning points of the classical oscillator at x = +x, refer to the values of x at 
which the oscillator momentarily comes to rest and reverses its direction of motion. In other words, x, is the 


largest displacement or the amplitude of the classical oscillator. The total energy E of the classical oscillator is 


2 


conserved and its value at the classical turning points is E = 5 mw? xé. In quantum physics, the one-dimensional 


harmonic oscillator can be found anywhere along the x-axis from x = —co to x = +00. The probability of 


finding the quantum oscillator between x and x + dx is 
P(x) dx = C exp(—x?/x?) dx. 
Determine C if this probability distribution function is normalized to 1. 
Find the probability of locating the quantum oscillator outside the classical turning points. 


3. The simplest of all atoms is the hydrogen atom consisting of a proton and an electron. The proton is in the 
nucleus and the electron can be anywhere in the space surrounding the nucleus, with its position given by a 
distribution function. The probability of finding the electron at a distance between r and r + dr from the 


nucleus is 
P(r) dr = Cr? exp(—2r/a,) dr, 


where a, is a constant (called the Bohr radius). Here, 7 can take any value between 0 and oo. 
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Determine C if this probability distribution function is normalized to 1. 
Using this probability distribution function, determine the average value 7 of r. 


4. Biomedical Physics Application. The gestation period of human pregnancies has a normal distribution with a 


mean pt of 268 days and standard deviation o of 16 days. 
What percentage of pregnancies last between 260 and 284 days? 


The colloquial term “preemies” refers to infants who have preterm birth with gestational age of less than 37 


weeks, or 259 days. What percentage of infants are preemies? 


Post-term babies are born after 284 days of pregnancy. What percentage of pregnancies leads to post-term 


babies? 
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Appendix A: Limiting Value of sin 0/0 as @ — 0. 


Using simple facts from geometry and trigonometry, we can determine the limiting value of sin 0/0 as 0 


approaches 0. 


Consider a circular pie, of radius R, which is cut into slices. One slice of angular size 6 (shown in the left figure) is 
of area A,. Using the concept of proportionalities, ratio of area of slice (A,) and area of full pie (mR?) is same as 

2 
the ratio of angular size of slice (9) and angular size of full circular pie, namely, 27. Thus, “s = = or A, = — 
In the adjoining right figure, we show the same slice of angular size © and label one of its corners as A. From this 
corner A we drop a perpendicular line that meets the opposite side of the slice at point B. Also, at corner A, we 


draw a tangent to the circular pie (of center O) that meets the extension of line OB at point T. From this second 


figure, we note 


Area of triangle OAB < Areaof slice < Area of triangle OAT. 


; (OB)(AB) < A, - (OA)(AT) 


0 R? 
2 


; (Rcos@)(Rsind) < < ; (R)(R tan 6) 


Since cos@ > 1as@ — 0, it follows that 
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sin(@) _ ; 
050 0 = 


In addition, we note that for small values of 6, sin@ ~ @. 


Eq. (A-1) 
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Appendix B: Mnemonic for Maxwell’s Relations of Thermodynamics 


Laws of thermodynamics play fundamental roles in physics, biology, chemistry, chemical engineering, mechanical 
engineering, and other scientific disciplines. In their mathematical descriptions, these laws relate four physical 
variables, pressure (P), volume (V), temperature (T) and entropy (S). Derivatives of these variables are related via 


four Maxwell’s relations of thermodynamics, which are 


P 
Ss 
(=), (=), 
—=" — 


+() =+(5) 
OS)p —- \OP/ 


This appendix presents a simple mnemonic device for recalling these relationships without any complications for 
remembering the correct signs. Note that two of these relations have a positive (+) sign on the right-hand side 
and the other two have a negative (N) sign on the right-hand side. In each relationship, the variable with respect 
to which a partial derivative is taken on one side is kept fixed on the other side. The first two relations with the 
negative sign on the right-hand side can be memorized as follows. Recalling the letter N for the negative 
relations, wrap the four letters PSTV for the thermodynamic variables, in alphabetical order, along this N in two 


possible ways as shown here: 


In these figures, the placement of letters is in the same location/way as in the first two Maxwell’s relations, 


which contain the negative sign on the right-hand side. 
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To get the mnemonic for the other two Maxwell relations—namely, the ones with the positive sign on the right- 
hand side—rotate these figures by 77/2 (does not matter whether the rotation is clockwise or counterclockwise) 


and replace N (negative) by + (positive), to get 


In these figures, the placement of letters is in the same location/way as in the last two Maxwell’s relations, which 


contain the positive sign on the right-hand side. 
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Appendix C: Proof of the Epsilon-Delta Identity 


In the introduction of Levi-Civita symbol, €;;,, in the Interlude chapter, a relationship between €;;, and 


Kronecker delta 6;; was presented as 


>. EijkEim = Oj18km = OjmOxt- 


i 


This epsilon-delta identity can be proved by starting with a determinant of order 3 whose element, a;;, located 


yj 


at the intersection of i“ row and j*" column, is 6;j;- This determinant is 


611 542 443 1 0 0 
A=62, 622 623)=|0 1 O;=1. 


In the Interlude chapter, the value of Levi-Civita symbol, €;;,, was given as +1 (or —1) if an even (or odd) 
permutation of (ijk) gives (123). Thus, on replacing the row numbers (123) of A by (ijk), the value of the 


determinant will become €;;; instead of 1, that is 


61 9:2 Sis 
Oj. Oj2 Oia] = Ein - 
Ok x2 Oks 


Similarly, on replacing the column numbers (123) by (nlm), the value of the determinant will become 


Sin Oi Sim 
din Oj Ojm| = ijnEnim - 
On Oxi Om 


Now, replace n by i, and expand the determinant across first column explicitly, to get 
EijxEitm = 5:1 (Sj Siem = 5jm 5x1) - 5;i(5i1 Sm — OmOnm) ¥ dxi( On djm = Sim 5j1)- 


Finally, sum over i and use );3_, 6;; = 3 to obtain 


> EijxEiim = 3(5;5km a 5jm 5x1) = (5;.5km _ 5jm 5x1) a (Sjm Ste = 5;15km)s 


i 
or, 


Dir Ein liin = Oj Okm ae Ojm Ox 
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Appendix D: Areas under the Standard Normal Curve 


The table below provides the values of the Gaussian function, ®(x), as a function of x. 


P(x) = Tel exp(-u?/2) du 


} 


x ‘ 0.01 002 003 004 005 006 007 0.08 0.09 


RoOm=-oO 


—_— — st ot 
BOM=|-0 OOMNDD BWMH=+-OD OONRMH 


oak eh enh ek 


WWOWW WWWHO NMMNMNYMY NMMYMYD 
oon on ono ooaononu 


Table 13.1. Areas under the standard curve from —o to x. 


